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Abstract 



A simple random walk on a graph is a sequence of movements from one vertex to 
another where at each step an edge is chosen uniformly at random from the set of 
edges incident on the current vertex, and then transitioned to next vertex. Central 
to this thesis is the cover time of the walk, that is, the expectation of the number of 
steps required to visit every vertex, maximised over all starting vertices. In our first 
contribution, we establish a relation between the cover times of a pair of graphs, and 
the cover time of their Cartesian product. This extends previous work on special cases 
of the Cartesian product, in particular, the square of a graph. We show that when 
one of the factors is in some sense larger than the other, its cover time dominates, and 
can become within a logarithmic factor of the cover time of the product as a whole. 
Our main theorem effectively gives conditions for when this holds. The techniques and 
lemmas we introduce may be of independent interest. In our second contribution, we 
determine the precise asymptotic value of the cover time of a random graph with given 
degree sequence. This is a graph picked uniformly at random from all simple graphs 
with that degree sequence. We also show that with high probability, a structural 
property of the graph called conductance, is bounded below by a constant. This is of 
independent interest. Finally, we explore random walks with weighted random edge 
choices. Wc present a weighting scheme that has a smaller worst case cover time than 
a simple random walk. We give an upper bound for a random graph of given degree 
sequence weighted according to our scheme. We demonstrate that the speed-up (that 
is, the ratio of cover times) over a simple random walk can be unbounded. 
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Chapter 1 
Introduction 



Let G = {V, E) be a finite, connected, undirected graph. Suppose we start at 
time step t = on some vertex u E V and choose an edge e uniformly at random 
(uar) from those incident on u. We then transition e to the vertex that the 
other end of e is incident on. We repeat this process at the next step, and so 
on. This is known as a simple random walk (often abbreviated to random walk) 
on G. We shall denote it by W„, where the subscript is the starting vertex. We 
write Wu(t) = a; if the walk is at vertex x at time step t. 

Immediately, a number of questions can be asked about this process. For exam- 
ple, 

(1) Does Wu visit every vertex in G7 

(2) If so, how long does it take on average? 

(3) On average, how long does it take to visit a particular vertex vl 

(4) On average, how long does it take to come back to itself? 

(5) In the long run, do all vertices get visited roughly the same number of times, 

or are there differences? 
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(6) If there are differences, what is the proportion of the time spent at a par- 

ticular vertex v in the long run? 

(7) How do the answers to the above questions vary if we change the starting 

vertex? 

(8) How do the answers to the above questions vary for a different graph? 

This thesis addresses all of these questions in one way or another for specific 
classes of graphs. However, the particular question that is the central motivation 
for this work is the following: 

For a random walk Wu on a simple, connected, undirected graph G = {V,E), 
what is the expected number of steps required to visit all the vertices in G, max- 
imised over starting vertices u ? 

The following quantities, related to the questions above, are formally defined in 
chapter |2} The expected time it takes Wu to visit every vertex of G is the cover 
time from u, CO\u[G], and the cover time COV[G] = max^ COVufG]. The 
expected time it takes W„ to visit some v is the hitting time li[u,v], and when 
V = u, it is called the first return time. 

These questions, much hke the process itself, are easy to understand, yet they and 
many others have been been the focus of a great deal of study in the mathematics 
and computer science communities. Some questions are easy to answer with 
basic probability theory, others are more involved and seem to require more 
sophisticated techniques. The difficulty usually varies according to what kind 
of answer we are looking for. For example, for the 2-dimensional torus with 
N = vertices, COV[Z^] = 0(A^log^ A^) is not too difficult to show with some 
of the theory and techniques we present in chapters |4] and |5| However, it was 
not until quite recently that a precise asymptotic result of COV[Z^] ~ ^ log^ N 
was given by |29j . 

This thesis is concerned primarily with cover time. 
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1.1 Applications 



1.1 Applications 

Before we give an outline of the thesis, we mention the applications of random 
walks and in particular, cover times. Applications are not the focus of this thesis, 
but it is worth mentioning their role, particularly in algorithmic and networking 
areas. The classical application of random walks in an algorithmic context is 
a randomised s — t connectivity algorithm. The problem, known as the s-t 
connectivity problem, is as follows: Given a graph G = {V, E), with |y | = n and 
\E\ = m, and two vertices s,t & V, if there is a path in G connecting s and 
t, return "true" otherwise, return "false". This can be done in time 0{n + m) 
with, for example, breadth first search. However, the space requirement is Q{n) 
for such an algorithm (or various others, such as depth first search). Take, for 
example, the case where G is a path of length n and s and t are ends of the path. 

With a random-walk based algorithm, we can present a randomized algorithm 
for the problem that requires 0(log?T,) space. It relies upon the following propo- 
sition, proved in chapter |5} See, e.g., [HI]. 

Proposition 1. For any connected, finite graph G = {V,E), CO\[G] < 4|V"||i?|. 

To avoid confusion with the name of the problem, we shall use the variable r to 
stand for time in the walk process. 

The algorithm is as follows: Start a walk Wg on G from vertex s. Assume time 
r = at the start of the walk. Stop the walk at time r if (i)r = 8nm or (ii) 
Ws(r) = t. If Ws(r) = t, then output "true". Otherwise output "false". 

Observe, if the algorithm returns "true", then it is correct, since there must be 
a path. If it returns "false", then it may be wrong, since it may have simply 
not visited t even though it could have. The question is, what is the probability 
that the algorithm returns an incorrect answer given that an s — t path exists? 
Suppose the random variable X counts the number of steps the random walk 
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1.2 Overview of the Thesis 



from s takes before visiting t. By Markov's inequahty, 

Pr(X > T) < 

Now, E[X] = H[s,t] and H[s,t] < COV[G]. So using Proposition [l| Pr(X > 
T) < 4nm/T, hence Pr(X > 8nm) < 1/2. 

The size of the input, that is, the graph, is n vertices and m < edges. This 
algorithm needs 0(logr;,) space since it need only store enough bits to keep track 
of its position and maintain a counter r. 

In fact, a breakthrough 2004 paper [1)5] showed that the problem can be solved 
by a deterministic algorithm using O(logn) space. Nevertheless, the randomised 
algorithm demonstrated here, as well as being very simple, remains a strong 
example of the role that the theory of random walks plays in applications. 

There are many other applications of random walks, particularly in networks and 
distributed systems, where they have been applied to self-stabilization ([24j), sen- 
sor networks ([SZ]), peer-to-peer networks [2S]) and voting ([26J), amongst 
many others. 

1.2 Overview of the Thesis 

Roughly speaking, this thesis can be divided into two parts. Aside from this 
chapter, chapters [2| |3j |4] and |5] are drawn from the established literature. They 
provide a review of some results and vital theory required for the original contri- 
bution. The original contribution can be considered to be in chapters [6| [7] and 

El 
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1.2 Overview of the Thesis 



1.2.1 Background 

In chapter [2| we give definitions and basic lemmas for graphs, random walks and 
Markov chains. We also give definitions of weighted random walks. These differ 
from simple random walks in how the next edge to be transitioned is chosen. 
Rather than choosing edges uar the probabilities are weighted by a weight 
assigned to the edge. This is more general than simple random walks, which 
are weighted random walks in which all edge weights are the same. Weighted 
random walks are the subject of chapter [8] 

Simple random walks on graphs are special cases of weighted random walks, 
which are in turn, special cases of Markov chains. Markov chains play a part 
in a fundamental lemma of chapter [6| and we use theorems from the literature 
based on Markov chains in chapters [7] and |8j In chapter |3| we give an account 
of the theory of Markov chains and random walks relevant to our work. Much 
of the theory presented in the framework of Markov chains is vital to sections 
of the thesis where only simple random walks are considered, and would have 
had to be written in a similar form had the more general presentation not been 
given. This, in conjunction with our use of Markov chains in various parts is why 
we chose to give a presentation in terms of Markov chains supplemented with 
explanations of how the general theory specialises for random walks. We also 
demonstrate a characterisation of Markov chains that are equivalent to weighted 
random walks. 

In chapter |4] we present the electrical network metaphor of random walks on 
graphs. This is a framework in which a theory has been built up to describe 
properties and behaviours of random walks in a different way. It provides a 
means of developing an intuition about random walks, and provides a tool kit 
of useful lemmas and theorems. Much of chapter |6] is built on the material in 
chapter |4j In chapter [7], the tools of electrical network theory are exploited in a 
number of proofs. 
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1.2 Overview of the Thesis 



In chapter [5] we present detailed proofs of hitting and cover times for specific, 
simple graph structures. We then give general techniques for bounding these 
parameters, including some we use in our proofs. We then give some bounds 
from the literature. 

1.2.2 Original Contribution 

In chapter [6] we present the first section of our original contribution. We study 
random walks on the Cartesian product F of a pair of graphs G and H. We 
refer the reader to chapter |6] for a definition of the Cartesian product. After 
giving definitions and context (including related work from the literature), we 
describe a probabilistic technique which we use to analyse the cover time. We 
then present a number of lemmas relating to the effective resistance of products 
of graphs. We apply the above to the problem of the cover time, and develop 
bounds on the cover time of the product F in terms of properties of the factors 
G and H. The resulting theorem can be used to demonstrate when the cover 
time of one of the factors dominates the other, and becomes of the same order 
or within a logarithmic factor of the cover time of the product as a whole. The 
probabilistic technique we introduce and the effective resistance lemmas may be 
of independent interest. This chapter is based on joint work with Colin Cooper 
and Tomasz Radzik, published in [2j. 

In chapter [7| we give a precise asymptotic result for the cover time of a random 
graph with given degree sequence d. That is, if a graph G on n vertices is 
picked uniformly at random from the set Q (d) of all simple graphs with vertices 
having pre-specified degrees d, then with high probability (whp), the cover 
time tends to a value r that we present. The phrase with high probability means 
with probability tending to 1 as tends to oo. After giving an account of the 
necessary theory, we give a proof that a certain structural property of graphs, 
known as the conductance, is, whp, bounded below by a constant for a G chosen 
uar from ^(d). This allows us to use some powerful theory from the literature 
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1.2 Overview of the Thesis 



to analyse the problem. We continue our analysis with further study of the 
structural properties of the graphs, and the behavior of random walks on them. 
We use these results to show that whp, no vertex is unvisited by time r + e, 
where e is some quantity that tends to 0. For the lower bound, we show that at 
time r — e, there is at least one unvisited vertex, whp. This chapter is based on 
joint work with Colin Cooper and Alan Frieze, published in [1]. 

Finally, in chapter |8| we investigate weighted random walks. Graph edges are 
given non-negative weights and the probability that an edge e is transitioned 
from a vertex u is w{e)/w{u), where w{e) is the weight of e and w{u) is the 
total weight of edges incident on u. We present from the literature a weighting 
scheme that has a worst case cover time better than a simple random walk. We 
then present our own weighting scheme, and show that it also has this property. 
We give an upper bound for the cover time of a weighted random walk on a 
random graph of given degree sequence weighted according to our scheme. We 
demonstrate that the speed-up (that is, the ratio of cover times) over a simple 
random walk can be unbounded. This chapter is based on joint work with Colin 
Cooper. 
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Chapter 2 



Definitions and Notation 



2.1 Graphs 

A graph G is a tuple {V,E), where the vertex set F is a set of objects called 
vertices and the edge set E is a. set of two-element tuples {u, v) or two-element 
sets {u, v} on members of V. The members of E are called edges. Graphs can be 
directed or undirected. In a directed graph the tuple {u, v) is considered ordered, 
so {u,v) and {v,u) are two different edges. In an undirected graph, in line with 
the conventions of set notation, the edge {u, v} can be written as {v, u}. How- 
ever, in the standard conventions of the literature, edges of undirected graphs 
are usually written in tuple form, and tuples are considered unordered. Thus, 
the edge {u,v} is written {u,v), and is included only once in E. We shall use 
this convention throughout most of the thesis, and be explicit when departing 
from it. Furthermore, we may use the notation u & G and e e G to stand for 
u eV and e E E respectively. 

In this thesis, we deal only with finite graphs, that is, both V and E are finite, 
and we shall sometimes use the notation V{G) and E{G) to denote the vertex 
and edge set respectively of a graph G. 
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2.1 Graphs 



A loop {u, u) is an edge from a vertex u to itself. In a multigraph, a pair of vertices 
u, V can have more than one edge between them, and each edge is included once 
in E. In this case, is a multiset. A graph is simple if it does not contain loops 
and is not a multigraph. 

For a vertex u G V, denote by N{u) C V the neighbour set of u, 



Denote by d{u) or du the degree of u. This is the number of ends of edges incident 
on i.e.. 



The second term in the sum shows that we count loops twice, since a loop has 
two ends incident on u. When G is simple it is seen that d{u) — \N{u)\. 

When a graph is directed, there is an in-degree and an out-degree, taking on the 
obvious definitions. 

If \V\ — n then G can be represented as a n x n matrix A = [oij], called 
the adjacency matrix. Without loss of generality, assume that the vertices are 
labelled 1 to n, then in A, ai^i — 21 where I is the number of loops from i to 
itself, and for i ^ j, a^j is the number of edges between i and j. 

A walk in a graph G is a sequence of (not necessarily distinct) vertices in G, 
{vq, Vi,V2, . . .) or (fo, f 1, V2, ■ ■ ■ , Vf) if the sequence is finite. A vertex Vi is followed 
by Vi+i only if {vi, t'i+i) & E. A walk is a path if and only if no vertex appears 
more than once in the sequence. If vq = Vt and this is the only vertex that 
repeats then then the walk is a cycle. 

If w = {vo,Vi,V2, . . . , Wt) is a walk, the length i{w) = t, of the walk is one less 
than the number of elements in the sequence. The distance between u, v is 
D{u,v) — min{£(p) : p is a path from u to v}. The diameter of G is D{G) — 




(2.1) 



d{u) ^\{e e E : e^ {u,x),x ^ u\\ + 2|{e ^E :e^ (^*)^*)}|- 



(2.2) 
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2.1 Graphs 



max{D{u,v) : u,v & V}. We may write Dq for D{G). 

A subgraph G' of a graph G, is a graph such that V{G') C V{G) and E{G') C 
We write G" C G. 

The foUowing simple lemma is very useful and common in the study of graphs. 
See, e.g., |30]. 



Lemma 1 (Handshaking Lemma). For an undirected graph G = {V,E), 
j:u^ydiu) = 2\E\. 

Note, there is no requirement that G be connected or simple. 



Proof Using equation (2.2) 



^d(M) = "^\{e e E : e = {u,x),x ^ u}\ + 2'^\{e e E : e = {u,u)}\ (2.3) 

If u,x & V and x then e = (m, x) G -E, if and only if e = (x, -u) G -E, because 
the graph is undirected (though a particular edge is only included once in E\ 
Hence, in the sum X^uey |{e G -E : e = (m, x), x 7^ u}| each edge (m, x) is included 



twice. Thus, (p| becomes Y.e={u,v)(iE 2 + 2 Ee=(«,«)ei<; 1 = 2|^|. □ 



2.1.1 Weighted Graphs 

Graphs may be weighted^ where in the context of this thesis, weights are non- 
negative real numbers assigned to edges in the graph. They can be represented 
as G = (V, -E, c) where c : — > IR+ is the weight function. We further define 

c(n) = ^(^) + E 2c(e) (2.4) 

e=(u,x) e={u,u) 
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2.1 Graphs 



and 

c(G) = J]c(n). (2.5) 
The weight of the graph, w{G), is 

w{G) = Y,c{e). (2.6) 

By the same arguments as the Handshaking Lemma [l| w{G) = c(G)/2. 
For convenience, we may also define 

c{u,v)= J2 (2-7) 

e={u,v) 
e&E, 

that is, the total weight of edges between u^v. 
Analogous definitions can be given for directed graphs. 

2.1.2 Examples 

We define some specific classes of graphs which will feature in subsequent chap- 
ters. All are simple, connected and undirected. Without loss of generality, we 
may assume that vertices are labelled [0,n — 1], where n = |V^|. 

Complete graph 

The complete graph on n vertices, denoted by Kn is the graph such that 
E = {{u^v) : u,v G V,u ^ f}, and so \E\ = (2) = '^^"^^^ . 

Path graph 

The path graph on n vertices, or n-path, denoted by P„ = (0, 1, 2, ... , n—1). 
E = {(0,l),(l,2),...,(n-2,n- 1)}. It has \E\ = \V\ - 1. 

Cycle graph 
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2.2 Markov Chains 



The cycle graph (or simply, cycle) on n vertices, denoted by Z„ = (0, 1, 2, ... , n— 
1,0). E = {(0, l),(l,2),...,(n-2,n-l),(n-l,0)}. It is same as P„, with 
an additional edge (n — 1, 0) connecting the two ends. It has \E\ = \V\. 

We say a graph G has a cycle if C G for some r. 

Trees 

A tree T is a graph satisfying any one of the following equivalent set of conditions 
(see, e.g., [HD]): T is connected and has no cycles; T has no cycles, and a cycle 
is formed if any edge is added to T; T is connected, and it is not connected 
anymore if any edge is removed from T; Any two vertices in T can be connected 
by a unique path; T is connected and has n — 1 edges. 

If, for a graph G, there is some tree T ^ G such that V{T) = V{G), then T is 
called a spanning tree of G. 

2.2 Markov Chains 

Let Q be some finite set. A Markov chain is a sequence X = (Xo,Xi, . . .) of 
random variables with Xj G having the Markov property, that is, for all t > 

Pr(Xt+i = x\Xi = xi,...,Xt = xt) = Pr{Xt+i = x \ Xt = Xt). (2.8) 

If, in addition, we have 

Pr(Xi+i = a\Xt = b) = Pr{Xt = a \ Xt-i = b) (2.9) 

for all t > 1, then the Markov chain is time-homogeneous. Such Markov chains 
can be defined in terms of the tuple Ai = {Q,P,Xq) where P = [Pi.j] is the 
X \Q\ transition matrix having entries Pjj = P[^, j] = Pr{Xt+i = j \ Xt = i). 
The first element of the sequence Xq is drawn from some distribution on Q, and 
in many applications this distribution is concentrated entirely on some known 
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2.2 Markov Chains 



starting state. 



Equations (2.8 ) and (2.9 ) together express the fact that, given knowledge of Xt-i, 
we have a probability distribution on Xt, and this distribution is independent of 
the history of the chain before Xj_i. That is, if Xt-i is known, any knowledge 
of Xs for s < t — 1 (should it exist) does not change the distribution on Xt. This 
is called the Markov property or memoryless property. 

Without loss of generality, label the n states of the Markov chain [l,n]. Let 
p(t) = [pi(t) , p2{t) , . . . , pn(t)] be the vector representing the distribution on 
states at time t. The first state Xq will be drawn from some distribution p(0), 
possibly concentrated entirely on one state. It is immediate that we have the 
relation 

n 

for any state of the Markov chain i. Alternatively, 

p(t) = p(t-l)P. 

For any s > 1, define the s-step transition probability 

=Pr(X,+,=j|Xi = z) 

and let P*-^^ = [Pif] be the corresponding transition matrix. Observe that 
p(i) = p and that 

n 

pW _ \^ p p(«-i) 



k=l 



Thus, 

and by induction on s, 



p{s) _ pp(s-l) 
p(s) _ ps 
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2.3 Random Walks on Graphs 



The above is consistent with the idea that P'-"-' = I, since this merely says that 
P(Xo = z I Xo = z) = 1. 

For a Markov chain Ai with Xq = i, define hi{j) = min{t > '■ Xt = j} and 
h^{j) = min{t > 1 : = j}. We define the following 

Hitting time from i to j 

First return time to i 

m=E[hti^)]. 

Commute time between u and v 

COM[i,j] = E[hi{j) + hj{i)] = H[z, j] + H[j, z] by linearity of expectation. 

Observe that H[i,z] = and H[z,j] > 1 for i 7^ j. Note, furthermore, that it is 
not generally the case that H[i, j] = H[j,2], although in some classes of Markov 
chains it is (examples would be random walks on the complete graph or the 
cycle, as we shall see in chapter [sj. 

The definition of hitting time can be generalised to walks starting according to 
some distribution p over the states: 

H[p,a] = J^piH[i,a]. 



2.3 Random Walks on Graphs 



A walk on a graph, as defined in section 2.1, is a sequence of vertices connected 



by edges (fo, f 1, f2, . . .). A random walk is a walk which is the outcome of some 
random process, and a simple random walk is a random walk in which the next 
edge transitioned is chosen uniformly at random from the edges incident on a 
vertex. 

Random walks on graphs are a specialisation of Markov chains. For a graph 
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2.3 Random Walks on Graphs 



G — (V,£'), the state space of the Markov chain is the set of vertices of the 
graph y, and a transition from a vertex u is made by choosing uniformly at 
random (uair) from the set of all incident edges and transitioning that edge. 
For undirected graphs, an edge can be traversed in either direction, and a loop 
counts twice. In a directed graph, the convention is that an edge is traversed in 
the direction of the arc, and so, where there is a directed loop, only one end can 
be transition. 

We give formal definitions. Let G = {V, E) be an unweighted, undirected simple 
graph. The Markov chain M.g = (Yi •) has transition matrix P^^v = l/d{u) if 
{u,v) G E otherwise Pu^v = 0. 

More generally, when G — {V,E,c) is weighted, undirected (and not necessarily 
simple) 



Observe, that by the above definitions, and the definition of c{u), a walk on 
an unweighted graph is the same (meaning, has the same distribution) as a 
walk on a uniformly weighted graph (that is, all edges have the same weight). 
Conventionally, when an unweighted graph is treated as a weighted graph, edges 
are given unit weight. 

Analogous definitions can be given for directed graphs. 

Some more notation concerning random walks: Let Wu denote a random walk 
started from a vertex u on a, graph G — (V, E). Let Pu\v) = Pr(W„(t) = v). 




(2.10) 



and 




(2.11) 



if V u. 
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2.3 Random Walks on Graphs 



For a random walk W„, let c„ = max^gy hu{v) where hu{v) was defined in section 



2.2 In addition to the quantities defined in that section (which are defined also 
for random walks on graphs, since they are a type of Markov chain), we define 
the following 

Cover time of G from u 

COV„[G] =E[c„]. 

Cover time of G 

COY[G] = max„ey COV„[G]. 
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Chapter 3 



Theory of Markov Chains and 
Random Walks 

3.1 Classification of States 

The states of a Markov chain exhibit different behaviours in generaL It is often 
the case that some of these properties can be ascertained by visual inspection of 
the graph of the chain, particularly when the graph is small. 

Much of what follows is standard material in an introduction to the topic. Aside 
from minor modifications, we quote heavily from [61] for many of the following 
definitions and lemmas. 

As before, we shall assume without loss of generality that the states of a chain 
with n states are labelled 

Definition 1 ([SI])- A. state j is accessible from a state i if Pj^^j > for some 
integer t > 0. If two states i and j are accessible from each other we say they 
communicate and we write i ^ j ■ 
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3.1 Classification of States 



In the graphical representation of a chain i ^ j if and only if there 
is a directed path from i to j and there is a directed path from j to 
i. 

-Extracted from [61J p. 164, with minor modifications. 

For random walks on undirected graphs, this is equivalent to a path existing 
between i and j. The following lemma is easy to confirm, and we omit the proof. 

Proposition 2 ([6T]). The communicating relation defines an equivalence rela- 
tion, that is, it is 

1. refiexive - for any state i, i ^ i; 

2. symmetric - ii->j^j^i; 

3. transitive - i ^ j and j k =^ i i-^ k. 

Note that a self-loop is not required for a state to be refiexive, since P/°^ = 1 by 
definition. Thus, the communication relation partitions the states into disjoint 
equivalence classes called communicating classes. The following corollary is a 
simple consequence 

Corollary 2. A chain cannot return to any communicating class it leaves. 

For random walks on undirected graphs, the communicating classes are the con- 
nected components of the graph. 

Definition 2 ([HI])- A Markov chain is irreducible if all states belong to one 
communicating class. 

Random walks on undirected graphs are therefore irreducible if and only if the 
graph is connected (i.e., a single component). More generally, a Markov chain 
is irreducible if and only if the graphical representation is strongly connected 

(ED). 
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3.1 Classification of States 



Denote by flj the probability that, starting at state i, the first time the chain 
visits state j is t; that is 

4? = Pr{Xt = j and, for 1 < s < t - 1, X, ^ j | Xq = t). 

Definition 3 ([HI])- A state i is recurrent if Ylt>i fi^i ~ ^ '^'^'^ '^^ transient 
if Xlt>i /ii^ < 1. A Markov chain is recurrent if every state in the chain is 
recurrent. 

A recurrent state is one which, if visited by the chain, will, with 
probability 1, be visited again. Thus, if a recurrent state is ever 
visited, it is visited an infinite number of times. If a state is transient, 
there is some probability that the chain will never return to it, having 
visited it. For a chain at a transient state i, the number of future 
visits is given by a geometrically distributed random variable with 
parameter p = ^t>iffi- "^^^ state in a communicating class is 
transient (respectively, recurrent) then all states in that class are 
transient (respectively, recurrent). 

-Extracted from [61], p. 164, with minor modifications. 

Recalling the definition of H[i, j] from section 

for i j and X]t>i^ ' fti ~ R-i*]- 1^ is not necessarily the case that a recurrent 
state has R[i] < oo; 

Definition 4 ([ni])- A recurrent state i is positive recurrent if H[i] < oo, 
otherwise it is null recurrent. 

An example of a Markov chain with a null recurrent state is given in [61] chapter 
7; it has an infinite number of states, in fact. 

Lemma 3 ([61]). In a finite Markov chain: 
1. At least one state is recurrent. 



2.2 



we have Y.t>i ^ ' fi j = H[i, j 
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2. All recurrent states are positive recurrent. 
The proof is left as an exercise, thus we include our own. 

Proof 1. Since there are a finite number of communicating classes, and 
since once the chain leaves a communicating class it cannot return, it must 
eventually settle into one communicating class. Thus, at least one state in this 
class is visited an unbounded number of times after the chain enters it. 

2. The communicating class of a recurrent state has no transition outside that 
class, since otherwise there would be a positive probability of no return. Consider 
a recurrent state i of an n-state Markov chain. Let C{i) denote the communi- 
cating class of i. Let p be the largest transition probability less than 1 from any 
of the states in C{i). Any walk of the chain of length i > n in C{i) that avoids 
i must include at least one transition with probability at most p. Then for any 
j eC{i),j^i, 

H[i,i] = J]Pr(/i,., > t) <nJ2p'^ = Y^<^- 

t>l m>0 ^ 

□ 

The above gives the following 

Corollary 4. For a finite Markov chain, if any state i is recurrent, then all of 
states of the communicating class of i are positive recurrent. 

We next discuss periodicity of Markov chains. As suggested by the name, peri- 
odicity is a notion of regular behaviour of Markov chains. As a simple example 
of periodic behaviour, consider a 2-state Markov chain with states {1, 2}, with 
each state having a transition to the other with probability 1. If the chain starts 
at state i e {1,2}, then it will be at state i for all even time steps (including time 
0), and it will be at the other state at all odd times. This oscillatory behaviour 
means that the distribution of the chain on states can never converge, and this 
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hints at the importance of periodicity - or lack of it. 

Definition 5 ([SI])- A state j in a Markov chain is periodic if there exists an 
integer A > 1 such that Pr(Xt+s = j \ Xf = j) = unless s is divisible by A. 
A discrete time Markov chain is periodic if any state in the chain is periodic. A 
state or chain that is not periodic is aperiodic. 

There is an equivalent definition of an aperiodic state; [M] P-40, gives the fol- 
lowing definition. 

Definition 6 ([HI])- A state i is aperiodic if Pi^i > for all sufficiently large t. 

This is followed by the following theorem, which establishes an equivalence be- 
tween the two definitions. 

Theorem 5 ([H^). A state i is aperiodic if and only if the set S = {t : P/*^ > 0} 
has no common divisor other than 1. 

In [M], the proof is left as an exercise to the reader, and so we present a proof 
below. 

Proof 

^ If S* has a common divisor A > 1, then any t > 1 that is not a multiple of A 
is not in S, and so Pj;f = in this case. 

<^ Let ti = minS". Since ti has a finite number of factors, and since for any 
S',S" C S* we have S' C S" =^ gcd(S") > gcd(5"'), we deduce that there must 
be some finite S' C S with ti E S' such that gcd(S") = 1. By the extended 
Euclidean algorithm (see, e.g. |16]) -or, in fact, Bezout's lemma - there must be 
some aj G Z such that 

aiti + 02^2 + • • • drtr = 1, (3.1) 

where the tj are members of S'. 
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Now Qj = a'j (mod ti) for some < a'- < ti, so substituting a'- for aj in equation 



(3.1) and taking it modulo ti we have 

i = 02^2 + • • • a'rtj- = 1 (mod ti). 

If t = s (mod ti) then t — si = (mod ti). Furthermore, if t > (ti — 1)£, then 
t = mti + s£ for some non-negative integer m. Since any positive sum of elements 
in 5* is an element in S*, it follows that P/*'' > . □ 

Finally for this section, we define and discuss ergodicity: 



Definition 7 ([HI])- An aperiodic, positive recurrent state is an ergodic state. 
A Markov chain is ergodic if all of it's states are ergodic. 

As a corollary to the above theorems, we have 

Corollary 6 ([HI])- Any finite, irreducible, and aperiodic Markov chain is an 
ergodic chain. 

Proof A finite chain has at least one recurrent state by Lemma [3] and if 
the chain is irreducible, then all of its states are recurrent. In a finite chain, 
all recurrent states are positive recurrent by Lemma [3] and thus all states of the 
chain are positive recurrent and aperiodic. The chain is therefore ergodic. □ 

The significance of ergodicity is made clear in the following section. 



3.2 The Stationary Distribution 

Recall that the distribution on the states evolves with this relation 

p{t + s) = p{t)P'. 
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3.2 The Stationary Distribution 



A fundamental question is when, if ever, there exists a distribution that remains 
invariant under the operation of post-multiphcation by the transition matrix. 

Definition 8 ([61]). A stationary distribution (also called an equihbrium dis- 
tributionj of a Markov chain is a probability distribution vr such that 

TT = vrP. 

A chain in the stationary distribution will continue to be so after subsequent 
transitions. 

We now quote an important theorem from [ni], but omit the proof, which al- 
though not difficult, is fairly lengthy. 

Theorem 7 ([61j). Any finite, irreducible, and ergodic Markov chain has the 
following properties: 

1. the chain has a unique stationary distribution vr = (tti, 7r2, . . . vr^); 

2. for all i and j , the limit limt^oo Pij exists and is independent of i; 

3. TT,- = limi^oo = VRbl 

There are a number of other proofs of Theorem [7j [M] gives a proof based on 
coupling; [57] gives a different different proof also based on couphng; [59] gives 
a treatment specialised for connected undirected graphs using the framework 
of linear algebra, in particular, the eigenvalues of the transition matrix and 
the Perron-Frobenius theorem to show the existence and convergence to the 
stationary distribution. 

For a random walk on an undirected graph that is finite, connected and not 
bipartite, the conditions of ergodicity and thus the conditions for Theorem [7] are 
satisfied. 
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3.3 Random Walks on Undirected Graphs 

For a random walk on an undirected graph a pair of vertices u, v communicate if 
there is a path between them. Furthermore, as stated above, the communicating 
classes are the connected components of the graph and so the random walk is 
irreducible if and only if the graph is a single connected component. For aperi- 
odicity, we quote the following lemma from [61], along with the accompanying 
proof. 

Lemma 8 ([61]). A random walk on an indirected graph G is aperiodic if and 
only if G is not bipartite. 

Proof A graph is bipartite if and only if it does not have cycles with an odd 
number of edges. In an undirected graph, there is always a path of length 2 from 
a vertex to itself. If the graph is bipartite, then the random walk is periodic 
with period 2. If the graph is not bipartite, then it has an odd cycle and by 
traversing the cycle, an odd-length path can be constructed from any vertex to 
itself. It follows that the Markov chain is aperiodic. □ 

Thus, by Corollary [6] and Lemma |8] it is seen that 

Corollary 9. A random walk on an undirected graph that is finite, connected 
and non-bipartite is ergodic. 

The existence of the stationary distribution and the convergence of the walk to 
it is thus established. 

Theorem 10 ([Hlj). A random walk on an undirected graph G = (y,E) that 
is finite, connected, and not bipartite converges to the stationary distribution vr 
where, for any vertex v E V 

d{y) 
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3.4 Time Reversal and a Characterisation of Random Walks on Undirected 

Graphs 

Proof By the handshaking lemma, Ylvev '^(^) ~ ^l-^l- Thus, it follows that 

and so vr is a proper probability distribution over V. Let P be the transition 
matrix of the walk on G and let N{v) represent the neighbour set of v. The 
relation vr = ttP is equivalent to 

Ed{u) 1 d{v) 
2\E\d(JI) ^ 2\E\ 

and the theorem follows. □ 

The above proof is for simple graphs. It can be generalised for graphs with 
loops and/or parallel edges by with stationary probability ^^ where now d(v) 



simple) weighted graphs with stationary probability vr^ = ^j^. See section 



is given by (2.2). The theorem can be further generalised to (not necessarily 

c(G) 



2.3 



for relevant definitions. 



3.4 Time Reversal and a Characterisation of 
Random Walks on Undirected Graphs 

This section discusses a characterisation of Markov chains that precisely cap- 
tures random walks on undirected graphs, including non-simple graphs (those 
with loops and/or parallel edges) as well as weighted undirected graphs. To do 
so we introduce the following definition, taken from [M] 

Definition 9. Let {Xt)o<t<T be a (suh)sequence of states of a Markov chain 
M. = (^7,P,7r) where vr is the stationary distribution of Ai. The time reversal 
of {Xt)o<t<T is the sequence {Yt)o<t<T where Yt = Xt-i- 



32 



3.4 Time Reversal and a Characterisation of Random Walks on Undirected 

Graphs 



The following theorem is given in [6l], and we omit the proof. 

Theorem 11. Let (Xt)o<t<T be a (suh)sequence of states of a Markov chain 
M. = {Q,P,7t) where n is the stationary distribution of Ai. Then {Yt)Q<t<T is 
a (sub)sequence of states of a Markov chain M. = (f2,P,7r) where P = [Pjj] is 
given by 

TTjPj^i = T^iPij for all i,j (3.2) 
and P is also irreducible with stationary distribution n. 
This leads us to the following definition 

Definition 10. //P = P then the Markov chain Ai is said to be time reversible. 



If a Markov chain is reversible, then (3.2) becomes 



HjPj^i = HiPij for alH, j (3.3) 
known as the detailed balance condition. 

Conversely, if the detailed balance condition is satisfied for some distribution p, 
that is 

PjPj^i=PiPij foralH,j 
then p is the stationary distribution, which, along with irreducibility and The- 



orem 1 1 , implies the following 



Corollary 12. An irreducible Markov chain with a stationary distribution it is 



reversible if and only if it satisfies the detailed balance condition (3.3). 
The next theorem characterises Markov chains as random walks. 

Theorem 13. Random walks on undirected weighted graphs are equivalent to 
time reversible Markov chains. That is, every random walk on a weighted graph 
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3.4 Time Reversal and a Characterisation of Random Walks on Undirected 

Graphs 



is a time reversible Markov chain, and every time reversible Markov chain is a 
random walk on some weighted graph. 

Proof The transition matrix for a random walk on a weighted undirected 
graph G = {V,E) defines 

_ c{u,v) 
ciu) 



where c{u,v) is defined by (2.7) (and therefore valid for non-simple graphs). 
Thus 

c{u) c{u,v) c{u,v) c{y,u) c{v) c{v,u) 
" c{G) ciu) ^ ~4GY ^ ~4GY ^ c{G) c{v) ^ '"'^ 

Hence, the detailed balance condition is satisfied and by Corollary[l2| the random 
walk is a reversible Markov chain. 

Consider some reversible Markov chain M = (f2,P,.) with transition matrix 
P and where, as before, we assume the states are labelled [l,r;,]. We define a 
weighted graph G = {V,E) based on M as follows: Let V = Q, let {i,j) € 
E if and only if Pjj > and weight the edge as c{i,j) = TTiPij. By 

reversibility, the detailed balance equations imply c{i,j) = c{j,i), hence weights 
are consistent and the weighted graph is proper. The random walk on the graph, 
by construction, has transition matrix P. □ 
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Chapter 4 



The Electrical Network 
Metaphor 



In this chapter we give an introduction to the electrical network metaphor of 
random walks on graphs and present some of the concepts and results from the 
literature that are used in subsequent parts of this thesis. Although a purely 
mathematical construction, the metaphor of electrical networks facilitates the 
expression of certain properties and behaviours of random walks on networks, 
and provides a language for which to describe these properties and behaviours. 
The classical treatment of the topic is |33]. The recent book [57J presents material 
within the more general context of Markov chains. Other treatments of the topic 
are found in [3] and [55] . 

We first present some definitions. 
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4.1 Electrical Networks: Definitions 

An electrical network is a connected, undirected, finite, graph G = {V, E) where 
each edge e E E is has a non-negative weight c(e). The weight is called the 
conductance. The resistance of an edge e, r(e) is defined as the reciprocal of 
the conductance, l/c(e) if c(e) is finite, and is defined as oo if c(e) = 0. It is 
quite often the case in the literature that in the context of electrical networks, 
the vertices of the network are referred to as nodes. We shall use the terms 
'graph' and 'network', and 'vertex' and 'node', interchangeably in the context of 
electrical network metaphor. 

A random walk on an electrical network is a standard random walk on a weighted 



graph, as per the definition of section 2.3 A random walk on an electrical 



network are therefore equivalent to time reversible Markov chains by Theorem 

Since edges are always weighted in the context of electrical networks, we shall 
use the notion G = (V, E, c) for the network, where the third element of the 
tuple is the weighting function on the edges. 



4.2 Harmonic Functions 

Given a network G = (V, E,c), a function / : — > M is harmonic at u G if it 
satisfies 

f{u) = J2Pu,Jiv), (4.1) 



where Pu,v is defined by equations (2.10) and (2.11) 



For some set Vb C V, called boundary nodes, call the complement Vj = V \ Vb 
internal nodes. 
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4.2 Harmonic Functions 



Lemma 14. For a function fvg : — ?■ M, any extension of fvg toV, f : V 
that is harmonic on the internal nodes Vj attains it's minimum and maximum 
values on the boundary. That is, there are some b, B E Vb such that for any 
veV, f{b)<f{v)<f{B). 

Proof We start with the upper bound. Let M = max^^v fi^) and let 
Vm = {x e V : f{x) = M}. If Vb H Vm 7^ then we are done. If not, then 
Vm ^ Vj and we choose some x G Vm- Since / is harmonic on Vj, f{x) is a 
weighted average of its neighbours. It follows that f{y) = M for each neighbour 
y of X, i.e., y G Vm- Iterating this repeatedly over neighbours, we see that any 
path in the network must have the property that each 



Xi G Vm- Since a network is connected by definition (see 4.1), there must be a 
path from x to some 2; G Vb? in which case we get a contradiction, and therefore 
deduce that Vb H Vm 7^ 0. A similar argument holds for the minimum. □ 

Theorem 15. For a function fvg : Vb — ?■ M the extension of fvg to V , f : V ^ 

M is unique if f is harmonic on all the internal nodes V/. 

Proof Suppose there are functions /, g which extend fvg and are harmonic 
on each node in Vj. Consider h = f — g. This function has h{v) = for any 



V G Vb, and is harmonic on Vj. It follows by Lemma 14 that h{v) = for any 



f G V/ as well. Therefore g = h. □ 

The problem of extending a function fvg to a function / harmonic on Vj is 
known as the Dirichlet problem, in particular, the discrete Dirichlet problem, (in 
contrast to the continuous analogue). For electrical networks (and in fact more 
generally, for irreducible Markov chains), a solution to the Dirichlet problem 
always exists, as provided by the following function. 

Theorem 16. Let G = (V, E, c) be a network, Vb G V , be a set of boundary 
nodes and Vj = V \ Vb be the internal nodes. Let fy^ : — > M 6e a function 
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on the boundary nodes. Let X(W) he a random variable that represents the first 
V & Vb visited by a weighted random walk W on G = {V, E, c) started at som,e 
time. The function f{v) — E[/vg(X(W^))], where v & V, extends fvg and is 
harmonic on Vj. 

Proof For a node v eVb, 

f{v) = E[/y,(X(W„))] = nfvM] = fvsiv). 

thus, / is consistent with fvg on the boundary nodes. 
For u e Vi, 

f{u) = E[/v,(X(>V„))] 

= J2nfvAx{yv.))]Pu,v 

vev 

This proves harmonicity on l^. □ 

4.3 Voltages and Current Flows 

Consider a network G — (V, E, c) and let a pair of nodes a and z be known as the 
source and sink respectively. Treating a, 2; as the only elements of a boundary 
set on the network, a function W harmonic on y \ {a, z} is known as a voltage. 

For an edge e = {u, v), denote by = {u, i) = {h, u) an orientation of the edge 
from u to V. Furthermore, if = (m, l) then let V = ('^, v) = {v, A flow 
(/? : ^ ^ M where ^ = : e e i?} U {V : e e is a function on oriented 
edges which is antisymmetric, meaning that '^{^) = — ^^(V). For a flow </?, 
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define the divergence of at a node u to be 

Observe, for a flow y?, 

5^divv.H = 5^ ^(^) = 5^^(^) + y.(V) = 0. (4.2) 

u&v «ey-^=(^)g:^ eeB 

We term as a flow from a to z a. flow ip satisfying 

1. Kirchhoff's node law 

div Lp{v) = for all v ^ {a, z}, 

2. div ip{a) > 0. 

The strength of a flow (f from a to 2; is defined to be \\ip\\ = div (f{a) and a unit 



flow from a to 2; is a flow from a to z with strength 1. Observe also that by (4.2) 
divy9(a) = — div {p{z). 

Given a voltage W on the network, the current flow I associated with W is 
defined on oriented edges ~^ = {u, h) by the following relation, known as Ohm's 
Law: 

iCt) = ^^^^ ~ ^^^^ = c{e){W{u) - W{v)) (4.3) 
r[e) 

Conductances (resistances) are defined for an edge with no regard to orientation, 
so in (4.3) we have used for notational convenience that c(e) = c{~^) = c(V). 



We will continue to use this. 

Let G be a network and for some chosen boundary points Vb C V let f : V 
harmonic on the internal nodes Vi = V \ Vb- For a transformation of the form 
T(x) = ax + (3, applying T to / on the boundary points we get a new set of 
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boundary node values given by T{f{v)) for any v G Vb- At the same time, T(/) 
is a solution to the Dirichlet problem for the new boundary node values, and so 



by Theorem 15 , it is the only solution. 



Now let Iw be the current flow by a voltage W on G (with some chosen source 
and sink) and It{w) the current flow from the transformation T applied to W. 
It can be seen from the definition of current flow that It{w) = o. ■ Iw- In 
particular, this means that current flow is invariant with respect to /3. Thus, 
assuming constant edge conductances, current flow is determined entirely by 
= W{a) — W{z). We may therefore denote the current flow determined by 
Aw by I Aw- Observe Iq = and la- Aw = ' ^Aw- Thus if, for a given G, any 
finite, non-zero current flow exists, then ||/awII as a function of Aw is a bijective 
mapping from M to M. In particular, if any finite, non-zero current flow exists, 
then a unit current flow exists and is unique. 

We show that / is a flow from a to z when W{a) > W{z). Firstly, consider any 
node u ^ {a, z}: 



•,= {u,v) 
e&E 

J2 cie)iWiu)-Wiv)) 



~^={u^) e=(u,v) 



e={u,v) 



w{u) J2 c(^) - c(^) E ^^(^') 

e={u,v) e={u,v) 
eeE e£E 

W{u)c{u) -c{u)W{u) = 



The last line follows because 



E ^^^^ = ^^^^ 



e={u,v) 
e£E 
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by definition and 



c(u) 

e={u,v) ^ ' 



by harmonicity. 



Now if W[a) > W{z) then by Lemma 14, W{a) > W{v) for any v ^V. There- 
fore, 

div/(a)= c{e){W{a) -W{v))>{). 

e={a,v) 

Thus, having proved both conditions of the definition, it is shown that the current 
flow is a flow from a to z. Furthermore, since setting A^r = W{a) — W{z) = 1 
will give some current flow Ji, setting W{a) — W{z) = l/||/i|| will give a unit 
current flow. 

The significance of the unit current flow will become clear in the discussion of 
effective resistance. 



4.4 Effective Resistance 

We begin with the definition 

Definition 11. Let G = {V,E,c) be a network, and let a,z be a pair of nodes 
in the network. Let W be any voltage with a, z treated as source and sink respec- 



tively and with W{a) > W{z). Using the notation of Section 4-3. the effective 
resistance between a and z, denoted by R{a, z) is defined as 

K[a, z) 



Clearly, for this definition to be proper, the ratio has to be invariant with respect 
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to voltages, and indeed it was shown in section 4.3 that la-d^w ~ ^ ' ^^w^ t\ms 
preserving the ratio. 

It is important to note that the resistance r{u,v) of an edge {u,v) is different 
to the effective resistance R{u,v) between the vertices u,v. Resistance r{u,v) is 
l/c{u,v), the inverse of conductance, which is part of the definition of the net- 
work G = (V, E, c), and is the weighting function c defined on an edge. Effective 
resistance, on the other hand, is a property of the network, but not exphcitly 
given in the tuple (V, E, c), and it is defined between a pair of vertices. 

Theorem 17 ([33] or | |57] ). Effective resistance forms a metric on the nodes of 
a network G = (V, E, c), that is, (1) R{v, v) = for any v ^ V (2) R{u, v) >Q 
for any vertices u,v E V (3) R{u,v) = R{v,u) for any vertices u,v E V (4) 
R{u, w) < R{u, v) + R{v, w) for any vertices u,v,w E V ( triangle inequality^. 

Define the energy S{(p) of a flow ^9 on a network G = {V, E, c) as 

m = Y.l^ie)]Me) (4.4) 



Note, the sum in (4.4) is over unoriented edges, so each edge e is considered only 
once. Because flow is antisymmetric by definition, the term [v5(e)]^ is unambigu- 
ous. 

The following theorem is useful in using current flows to approximate effective 



resistance. We shall see such an application in section |6.8[ For a proof, see, for 
example [57] . 

Theorem 18 (Thomson's Principle). For any network G = {V, E, c) and any 

pair of vertices u,v E V, 

R{u, v) = mm{S{(p) : ip is a unit flow from u to v}. (4-5) 
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The unit current flow is the unique tp that gives the minimum element of the 
above set. 



4.4.1 Rayleigh's Monotonicity Law, Cutting &; Shorting 

Rayleigh's Monotonicity Law, as well as the related Cutting and Shorting Laws, 
are intuitive principles that play important roles in our work. They are very 
useful means of making statements about bounds on effective resistance in a 
network when the network is somehow altered. With minor alterations of nota- 
tion, we quote [ST] Theorem 9.12, including proof. 

Theorem 19 (Rayleigh's Monotonicity Law). If G = {V, E) is a network 
and c, c' are two different weightings of the network such that r(e) < r'(e) for all 
e E E, (recall, r(e) = l/c{e)), then for any u,v gV, 

R{u,v) < R'{u,v) 

where R{u, v) is the effective resistance between u and v under the weighting c 
(or r), and R'{u,v) under weighting c' (or r'). 

Proof Note that min^ ^gg^[(y9(e)]^r(e) < miu;^ ^ggg[(y9(e)]^r'(e) and apply 



Thomson's Principle (Theorem 18). □ 



Lemma 20 (Cutting Law). Removing an edge e from a network cannot de- 
crease the effective resistance between any vertices in the network. 

Proof Replace e with an edge of infinite resistance (zero conductance) and 
apply Rayleigh's Monotonicity Law. □ 

Lemma 21 (Shorting Law). To short a pair of vertices u,v in a network 
G, replace u and v with a single vertex w and do the following with the edges: 
Replace each edge {u,x) or {v,x) where x ^ {u^v} with an edge {w,x). Replace 
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each edge {u,v) with a loop {w,w). Replace each loop {u,u) or {v,v) with a 
loop {w,w). A new edge has the same conductance as the edge it replaced. Let 
G' denote the network after this operation, and let R and R! represent effective 
resistance in G and G' respectively. Then, for a pair of vertices a, z, ^ {n, v}, 
R'{a, z) < R{a, z), R'{a, w) < R{a, u) and R'{a, w) < R{a, v). 

Proof Consider nodes a,z ^ {u, v} of tlie network G = {V, E, c), and let G' 
denote G after a Sliorting operation on u, v. For any flow ip from a to z in G, we 
can define a flow ip' in G' as follows: For any e = {x,y) such that x,y ^ {u,v}, 
ip'C^) = fC^)- For e = {u,u) or e = {v,v), let e' = {w,w) be the edge that 
replaced e, and have v^'(e^) = V^("^) (orientating the loops arbitrarily). For 
e = {u,v) let e' = {w,w) be the edge that replaced e, and have (f'C^) = ^C^) 
(again, orientating the loops arbitrarily). It is easily seen that ip' is a valid unit 
flow from a to z, and that the energy on each edge is the same for both ip and 
(f'. It follows that S{(f') = £{'~p). Hence by Thomson's Principle, ii ip = ipmm, 
the unit current flow from a to z in G, then R'{a,z) < S{ip') = S{{p) = R{a,z) 
A similar argument can be made for R'{a,w) < R{a,u) and R'{a,w) < R{a,v). 

□ 

Sometimes the Shorting Law is defined as putting a zero-resistance edge between 
u, V, but since zero-resistance (infinite-conductance) edges are not defined in our 
presentation, we refer to the act of "putting a zero-resistance edge" between a 
pair of vertices as a metaphor for shorting as defined above. 

4.4.2 Commute Time Identity 

The following theorem, first given in [TH] is a fundamental tool in our analysis 
of random walks on graphs, in chapter [6] The proof is not difficult but we omit 
it because a presentation would be lengthy. We refer to [18] or [57] for a proof. 

Theorem 22 ([H]). Let G = {V, E, c) be a network. Then for a pair of vertices 
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u,v 

COM[u,v] = c{G)R{u,v). 
(The reader is reminded that c{G) = c(f ) = 2^gg^c(e), as defined in 



4.5 Parallel and Series Laws 



The parallel and series laws are rules that establish equivalences between certain 
structures in a network. They are useful for reducing a network G to a different 
form G", where the latter may be more convenient to analyse. We quote from 
j57j . with minor modifications for consistency in notation. 

Lemma 23 (Parallel Law). Conductances in parallel add. 

Suppose edges ei and 62, with conductances c(ei) and 0(62) respectively, share 
vertices u and v as endpoints. Then ei and 62 can be replaced with a single edge 
e with c(e) = c(ei) + 0(62), without affecting the rest of the network. All voltages 
and currents in G\{ei, 62} are unchanged and the current /("^) = /(ei) + /(e2). 
For a proof, check Ohm's and Kirchhoff's laws with /("^) = /(et) + I{e^)- 

Lemma 24 (Series Law). Resistances in series add. 

\i V & V \ {a, z}, where a and z are source and sink, is a node of degree 2 with 
neighbours fi and f2, the edges (fi,f) and (f,f2) can be replaced with a single 
edge (fi,f2) with resistance r{yi,V2) = r{vi,v) + r{v,V2). AH potentials and 
currents in G \ {t;} remain the same and the current that flows from Vi to V2 
is /(f 1, ^2) = /(f 1, i) = /(f , ^2). For a proof, check Ohm's Law and Kirchhoff's 
Law with I{vi, vl) = I{vi,i) = I{v, vl). 
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Chapter 5 

Techniques and Results for 
Hitting and Cover Times 



In this chapter we present some of the techniques for bounding hitting and cover 



times, as well as particular results. We start with section |5.1| where we give cal- 
culations for hitting and cover times of some particular graph structures. The 
graph classes that are the subject of the next section are important for subse- 
quent chapters, and as examples, they serve to convey some of the techniques 
used to precisely calculate hitting and cover time. This allows comparisons to be 
drawn with more general techniques and bounds. In general, it is quite difficult 
to calculate precise cover times for all but a few classes of graphs; the exam- 



ples given in section 5.1 are amongst the simplest and most common structures 



studied in the literature. 



We refer the reader to section 2.1.2 for reminders on definitions of graph struc- 



tures and section 2.3 for a reminder of relevant notations and definitions relating 



to random walks on graphs. 

We only deal with connected, undirected graphs in this chapter. 

The n'th harmonic number, h{n) = ^27=1 T ^ recurring quantity, and a short 
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hand proves useful. Note h{n) = Inn + 7 + 0{l/n) where 7 ~ 0.577 (see, e.g. 
|19j). Thus even for relatively small values of n, Inn is a close approximation 
for hn- 

5.1 Precise Calculations for Particular Struc- 
tures 

We deal with three particular classes of graphs: complete graphs, paths and 
cycle. These results are given in [59], amongst others. 

5.1.1 Complete Graph 

Theorem 25. Kn = {V, E) be the complete graph on n vertices. 

(i) R[n] = n for any u eV . 

(ii) H['U,f] = n — 1 ifu^v, for any pair of vertices u,v eV. 

(iii) COY[Kn] = {n- l)h{n - 1). 

Proof (i) \E\ = and d{u) = n — 1 for any u E V. Now use Theorem 

[To] and Theorem [7l 

(ii) The walk has transition probability Pu^v = ^rrf ii u v and Pu,u = 0. Thus, 
H[M,f] is the expectation of a geometric random variable with parameter 

i.e., H[m, v] = n — 1. 

(iii) Let T(r) be the expected number of steps until r distinct vertices have been 
visited by the walk for the first time. By the symmetry of the graph, T{r) will 
be invariant with respect to starting vertex. Suppose the walk starts at some 
vertex u E V. Then T(l) = 0. Since it will move to a new vertex in the next 
step, T(2) — T(l) = 1. Suppose it visits the r'th distinct vertex at some time t, 
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where r < n. Then there are n — r unvisited vertices, and the time to visit any 
vertex from this set is a geometric random variable with probability of success 
the expectation of which is = T(r + 1) — T{r). Thus by linearity of 
expectation, 

COV[;^„] = (TH-T(n-l)) + (T(n-l)-T(n-2)) + ... + (T(2)-T(l))+T(l) 

r=l 

□ 

Thus, COY[Kn] -nlnn. 



5.1.2 Path 



Without loss of generality, we label the vertices of the n- vertex path graph, P„ 
by the set [0,72 — 1], where labels are given in order starting from one end, i.e., 
P„ = (0,l,2,...,r2-1). 



Theorem 26. Let Pn be the path graph on n vertices. 

(i) R[0] = R[n - 1] = 2{n - 1) and R[-i] = n - 1 for < i < n - 1. 

(ii) When i < j, = — i"^. In particular, H[0,?2 — 1] = {n — 1)^. 

I ^ ^ ' ^ ij n even 

Proof (i) d{0) = d{n — 1) = 1, and d{i) = 2 when < i < n — 1. Further- 
more, \E\ = n — I. Now use Theorem 10 and Theorem [7j 

(ii) By part (i), R[n — 1] = 2{n — 1), but we also know that H[n — 1] = 
1 + H[n — 2,n — 1], because when the walk is on vertex n — 1 , it has no choice but 
to move to vertex n — 2 in the next step. So we have li[n — 2,n — l] = 2(n — 1) — 1. 
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Furthermore, H[r — 1, r] = 2r — 1 for any < r < n — 1, since this is the same 
as H[r — l,r] on P^+i- We have, by linearity of expectation. 



H[t,j] = ^ H[r - l,r] = ^ 2r - 1 = (j + z + l)(j - t) - (j - i) = j 



r=i+l r=i+l 



(iii) Our analysis will be more notationally convenient with n + 1 vertices than 
with n vertices. Let denote the expected time to reach either one of the ends, 
when starting on some vertex r G [0,n]. The variables a^. satisfy the following 
system of equations: 

if r = 

flr = < 1 + ^flr-i + ^c^r+i if < r < n (5.1) 
if r = n 

A solution is = r{n — r): Oq = 0(n — 0) = = n{n — n) = an, and 
1 1 

— ^(j^ _ _ \ _ _ — (7- — 1)) -|_ (7- -|- \){n — (r + 1))] 
= 0. 

Furthermore, this is the only solution. This can be seen by studying the system 
of equations written as a matrix equation, and determining that the equations 
are linearly independent, but a more elegant method relies on the principle that 
harmonic functions achieve their maximum and minimum on the boundary, as 



expressed in Lemma 14: Suppose that there is some other solution a'^ set the 



system of equations (5.1). Consider fr = ar — a'^. We have /o = oo — ci'o = 0, 
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and fn = (in — 0'n = 0- Furthermore, for < r < n, 

fr (^r 

= 1 + -ar-l + -ar+1 - ( 1 + 2^r-l + lfr+\ 

Thus, fr is harmonic on [l,n — 1]. Since = on the boundary vertices 



by Lemma 14 fr = for r G [1, n — 1], and so a' = a. 



To cover P„+i when starting at vertex r G [0, n], the walk needs to reach either 
one of the ends, then make its way to the other. Thus 

COV^[P„+i] = ar + H[0, n] = r{n - r) + n^. 

When n is even (i.e, the path P„+i has an odd number of vertices), then 
COV,,.[P„_|_i] is maximised at r = |, in which case, 

COV,[P„+i] = COV[P„+i] = — . 

When n is odd, COV.r[P„+i] is maximised at r = [|J or r = [l] , in which 
case. 



COV,[P„+i] =COV[P„+i] 





(^n — 




2 


2 


/ n 

u 


-0 


f n 


5n^ 


1 






~ 4' 





□ 
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5.1.3 Cycle 

Theorem 27. Let Z„ be the cycle graph on n vertices. 

(i) Il[u] = n for any vertex u. 

(ii) For a pair of vertices u, v distance r < n/2 from each other onZn, ii[u,v] = 

r{n — r), 

(iii) COV[Z„] = 

Proof (i) The vertices all have the same degree, so by symmetry, and in 
conjunction with Theorem [10} vr^ must be the same for all m G V^, that is, 
TXu = 1/n. Now apply Theorem [7| 



(ii) We use the principles of the proof of Theorem 26 We assume the vertices 
of Z„ are labelled with [0,n — 1] in order around the cycle. Hence, vertex 
0, for example, would have vertices 1 and n — 1 as neighbours. We wish to 
calculate H[r, 0]. Observe that there are two paths from r to 0; one path is 
(r, r — 1, . . . , 0). On this path the distance between vertices r and is r. The 
other path is (r, r + 1, . . . , n — 1, 0). The distance between r and on this path 



is (n — r). By the same principles as the proof of Theorem 26, we calculate 
H[r, 0] by equating it with the expected time it takes a walk to reach or of 
a path graph (0, 1, 2, . . . , n), when the walk starts at vertex r. As calculated for 



Theorem 26, this is r(n — r). 



(iii) To determine the cover time, observe that at any point during the walk, 
the set of vertices that have been visited will be contiguous on the cycle; there 
will be an "arc" (path) of visited vertices, and another of unvisited. Let T(r) be 
expected time it takes a walk starting at some vertex to visit r vertices of Z„. 
Given that the walk does indeed start on some vertex, we have T(l) = 0. After 
it moves for the first time, it visits a new vertex, thus giving T(2) — T(l) = 1. 
By linearity of expectation. 



51 



5.2 General Bounds and Methods 



COV[Z„] = (T{n)-T{n-l)) + {T{n-l)-T{n-2)) + ... + {T{2)-T{l)) + T{l). 

(5.2) 

Suppose the walk has just visited the r'th new vertex, where r < n. Without 
loss of generality, we can label that vertex r, and further label the arc of already 
visited neighbours (r — 1, r — 2, . . . , 1), in order. The arc of unvisited vertices 
is labelled (r + 1, r + 2, . . . , n — 1, 0), such that (r, r + 1) and (0, 1) are edges 
on the cycle. Thus, the next time the walk visits a new vertex, it will be either 
the vertex labelled r + 1 or the vertex labelled in the current labelling. Hence, 
T(r + 1) — T{r) is the same as the expected time it takes a walk on a path graph 
(0, 1, 2, . . . , r + 1) to reach or r + 1, when it starts on vertex r. As calculated 



above, this is r(r + 1 — r) = r. Equation (5.2) can thus be calculated as 



n-l 



r=0 

□ 



5.2 General Bounds and Methods 

In this section, we detail two general approaches for bounding cover times: the 
spanning tree technique, and the Matthews 's technique. Despite the simplicity of 
the techniques, they can often yield bounds that are within constant factors of 
the actual cover time. Both methods can be applied to a graph under question, 
but it is often the case that one is more suited, i.e, yields tighter bounds - than 
the other for a particular graph. In both cases, the effectiveness of the technique 
is dependent on finding suitable bounds on hitting times between vertices (or 
sets of vertices), as well as the way in which the technique is applied. 
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5.2.1 Upper Bound: Spanning Tree and First Return 
Time 

Let G = (V, E) be an undirected, unweighted, simple, connected graph. Let 
n = |\^| and m = \E\. One way to upper bound the cover time of G is to choose 
some sequence of vertices a = {vo,vi, . . . ,Vr) such that every vertex in V is in 
a, and sum the hitting time from one vertex to another in the sequence; that is, 

COV[G] <H[vo, vi] +H[vi,V2] + ... + H[?;,_i, 



Proposition 3. For a tree T = (V, E) we can generate a walk (sequence of edge 
transitions on T) a = (fo,fi, . . . ,f2|y|-2) such that each edge of T is traversed 
once in each direction. 

The sequence a of Proposition |3] contains every vertex of V. It can, in fact, 
be generated be the depth first search (DPS) algorithm started at some vertex 
V = Vq. We shall use DPS again in chapter |6} See, e.g., [SS] for a discussion of 
the algorithm. 

The following theorem and proof are given given in [61]. The argument itself 
goes back to [S]. 

Theorem 28. Let G = (V, E) he an undirected, unweighted, simple, connected 
graph, and let n = \V\ and m = \E\. 

COV[G] < Amn. 

Proof 
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For any v E V, using Theorem 10 and Theorem [7| we have 

2m 



RM 



d{v)' 



But we also know that 



where N{v) is the neighbour set of v. Thus, equating (5.3) and (5.4) 

2m 



d{v) d{v) 



u£N{v) 



and so H[M,t>] < 2m. 



(5.3) 



(5.4) 



Let T = (VjEt) be some spanning tree of G. Let a = {vo,vi, . . . ,V2n-2) be a 
walk as described in Proposition |3| Since each vertex of V occurs in a, we have 



2n-3 



COV[G] < U[vi, v^+l] < 2m{2n - 2) < 4mn. 



i=0 



□ 



5.2.2 Upper Bound: Minimum Effective Resistance Span- 
ning Tree 



In section 5.2.1 we computed an upper bound on the sum of commute times 
of a spanning tree of G. We can generalise this to trees that span G, that is, 
include all the vertices of G, but who's edges are not necessarily contained in G. 

Let G be a connected, undirected graph. If G is unweighted, assign unit weights 
(conductances) to the edges of G. Thus, G = (V, E, c). 
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Definition 12. Define the complete graph K = {V,E',p) where the weighting 
function p : E' IR+, and p{{u,v)) = R{u,v) where R{u,v) is the effective 
resistance between u and v in G. Let T = {T is a spanning tree of K}, and let 
eT he such that w{T^) < w{T) for any T G T (recall w{T) is the total of the 



edge weights ofT, as defined by equation 2.6). We call the minimum effective 
resistance spanning tree of G. 

Theorem 29. LetG he a connected, undirected graph. If G is unweighted, assign 
unit weights (conductances) to the edges of G. Thus, G = {V,E,c). 

COY[G] < c(G)w(T,), 

where is the minimum effective resistance spanning tree of G. 



Proof Using Theorem 22, we have for any u,v E V 



So 



COM[u,v] = c{G)R{u,v). 



iG)wiT,) = ciG) J2 Riu,v)= J2 COM[u,v]. (5.5) 

{u,v)&E' {u,v)&E' 



Now we continue using the same ideas of the proof of Theorem |28] we apply 
Proposition [3] to generate a sequence a that transition each edge of T^, once in 
each direction, thereby visiting every vertex of G. The RHS of the second equal- 



ity of (5.5), X](Mt))e£;' COM[m, v], is the sum of hitting times for the sequence a. 

□ 

For all but a few simple examples, it can be difficult to determine w(T^,). How- 
ever, bounds on effective resistances can often be determined using the various 
tools of electrical network theory; for example, through the use of flows and 
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Thomson's principle (Theorem 18), and other tools such as Rayleigh's laws, 
cutting and shorting laws, etc. 



5.2.3 Upper Bound: Matthews' Technique 

Theorem 30 (Matthews' upper bound, [62]). For a graph G = {V,E), 

COy[G]<H*[G]h{n), (5.6) 

where ii*[G] = maxu^vev v] and h{n) is the n'th harmonic number Y17=i n' 

We refer the reader to, e.g., [57] for a proof. The proof is not difficult, but is 
fairly lengthy. 

The power of the method is two-fold. Firstly, one needs only to bound H*[G'], 
which can be facilitated through electrical network theory as well as consider- 
ation of the structure of G. Secondly, it applies also to weighted graphs (note 
there is no restriction of being unweighted in the statement of the theorem). 

To use electrical network theory, we can use the commute time identity of The- 
orem 22, and use the commute time as an upper bound for hitting time. In 



this case, (5.6) is expressible as COV[G] < c{G)R*{G)hn, where R*{G) is the 



maximum effective resistance between any pair of vertices in G. 

Despite the simplicity of the inequality, the method can yield bounds on cover 
time that are within a constant factor of the precise value. This is always the 
case if !!*[(?] can be shown to be 0{n), since this gives a cover time of 0{n log n), 
and as we shall see in section 5.3.1[ cover times of graphs are f2(nlogn). 



One example of an application of Theorem 30 that gives good bounds is on the 



complete graph. As was established in Theorem 25, has H[m, f] = n — 1 for any 



pair u,v. The resulting bound is very close to the precise result. 
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Matthews' Technique for a Subset 



The inequahty (5.6) bounds the cover time of all the vertices of the graph, but 



it applies equally to a subset of the vertices V C V{G): 



Theorem 31 (Matthews' bound, subset version). Let 

H^[V'] = max{H[M,t;] : u,v e V'}, 

where li[u,v] is the hitting time from u to v in G. For a random walk on G 
starting at some vertex v E V , denote by COV^fV] the expected time to visit 
all the vertices of V . Then 

COY,[V'] < H*a[V']h{\V'\), (5.7) 



The notation CO\[V'] shaU mean max^gy/ COVy[V']. 



5.2.4 Lower Bound: Matthews' Technique 

There is also a lower bound version of Matthews' technique. A proof is given in 

Theorem 32 (Matthews' lower bound [62j). For the graph G = {V,E), 

COV[G] > maxH,[A]h{\A\ - 1). 
where HJA] = min^^^g^.^^^i, H[m, f] . 



When using Theorem |32| one needs to be careful with the choice of A; the end 
of a path has hitting time 1 to its neighbour. Thus, if both of these vertices are 
included in A, the bound is no better than logn, and of course, cover time is at 
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least n. 

A refinement on Theorem 32 was given in [71] . We shall quote it as in [3Z 
this is simpler notation. 



smce 



Lemma 33 Let S be a subset of vertices of G and let t be such that for 

all V E S, at most b of the vertices u E S satisfy ii[v,u] < t. Then 

COV4G] >t(log(|5|/6)-2). 



5.3 General Cover Time Bounds 
5.3.1 Asymptotic General Bounds 

In two seminal papers on the subject of cover time, Feige gave tight asymptotic 
bounds on the cover time. As usual, the logarithm is base-e unless otherwise 
stated. 



Theorem 34 ([37]). For any graph G on n vertices and any starting vertex u 

COV„[G] > (l + o(l))nlogn 

This lower bound is exhibited by Kn the complete graph on n vertices, as was 



demonstrated in Theorem 25 It is proven using Lemma 33 It is shown that at 



least one of the two conditions hold for any graph. 

1. There are two vertices u and v such that H[M,t>] > nlogn and H[f,-u] > 
nlogn. 



2. The assumptions of Lemma 33 hold with parameters 15*1 > ^/(loggn) 



b < (log2n)'^ and t > n{l — c/log2n) for some constant c independent of 
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n. 

The upper bound is as follows 

Theorem 35 ([36]). For any connected graph G on n vertices 

CyCOV[G]<(l + o(l))^n3 (5.8) 

The quantity CyCOV[G'] is the cyclic cover time, which is the expected time it 
takes to visit all the vertices of the graph in a specified cyclic order, minimised 
over all cychc orders. Clearly, COY[G] < CyCOY[G]. 

Consider the lollipop graph, which is a path of length ri/3 connected to a complete 
graph of 2?T,/3 vertices. Let u be the vertex that connects the clique to the path, 
and V be the vertex at the other end of the path. It can be determined that 
H[M,f] = COM['u,t'] — H[f,M] = (1 + o(l))^n^. This can be seen by applying 



Theorem 22 to get COM[u, v] and Theorem[26]to get H[f , u]. This demonstrates 
that the asymptotic values of maximum hitting time and cyclic cover time (and 
therefore also cover time) can be equal (up to lower order terms) even when the 
cyclic cover time is maximal (up to lower order terms). 



Theorem 35 is proved by a contradiction argument on the minimum effective 



resistance spanning tree (see section 5.2.2). A trade off is demonstrated 



between w{T^), the sum of effective resistances of edges of T*, and m, the number 



of edges of the graph. This bounds the cover time as per Theorem 29 
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Chapter 6 



The Cover Time of Cartesian 
Product Graphs 

In this chapter, we study the cover time of random walks on on the Cartesian 
product F of two graphs G and H. In doing so, we develop a relation between 
the cover time of F and the cover times of G and H. When one of G or if is in 
some sense larger than the other, its cover time dominates, and can become of 
the same order as the cover time of the product as a whole. Our main theorem 
effectively gives conditions for when this holds. The probabilistic technique 
which we introduce, based on a quantity called the blanket time, is more general 
and may be of independent interest, as might some of the lemmas developed in 
this chapter. The electrical network metaphor is one of the principle tools used 
in our analysis. 

G and H are assumed to be finite (as all graphs in this thesis are), undirected, 
unweighted, simple and connected. 
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6.1 Cartesian Product of Graphs: Definition, 
Properties, Examples 

6.1.1 Definition 

Definition 13. Let G = (VcEg) and H = {Vh,Eh) be simple, connected, 
undirected graphs. The Cartesian product, GOH of G and H is the graph F — 
{Vp, Ep) such that 

(i) Vf^VgX Vh 

(ii) ((a, x), (6, y)) e Ep if and only if either 

1. (a, b) e Eg and x = y, or 

2. a — b and {x, y) e Eh 

We call G and H the factors of F, and we say that G and H are multiplied 
together. 

We can think of F = GOH in terms of the following construction: We make a 
copy of one of the graphs, say G, once for each vertex of the other, H. Denote 
the copy of G corresponding to vertex x e Vh by G^- Let denote a vertex in 
Gx corresponding to a G Vq- If there is an edge {x,y) e Eh, then add an edge 
{axjtty) to the construction. 

Notation For a graph F = (^r, Er), denote by 

(i) ur the number of vertices |Vr|, and 

(ii) mr the number of edges \Ey\. 

In addition we will use use the notation N and M to stand for np and mp 
respectively. 
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6.1.2 Properties 

Commutativity of the Cartesian Product Operation For a pair of graphs 
G and iJ, GOH is isomorphic to HOG; that is, if vertex labels are ignored. 



the graphs are identical. Note, however, that by (i) of Definition 13, the two 



different orders on the product operation do produce different labellings. 

Vertices and Edges of the Product Graph The number of vertices and 
edges of a Cartesian product is related to the vertices and edges of its factors as 
follows: 

(i) N = ugUh- 

(ii) M = nGtriH + nHmc- 

(i) follows from the properties of Cartesian product of two sets, and (i) of Def- 



inition 13 To see (ii), we have by (ii)-l of Definition 13 the following: For a 
vertex x G Vh, there is, for each (a, 6) G Eq, an edge ((a, x), {b,x)) G Ep. That 
is, we have the set 



Sh,x = {{{a, x), (6, x)) : (a, b) G Eg} 



Similarly, we have by (ii)-2 of Definition 13 the following: For a vertex a G Vg, 
there is, for each {x,y) G E^, an edge ((a, x), {a,y)) G Ep. That is, we have the 
set 



Thus, 



Sca = {{ia,x), {a,y)) : {x,y) G Eh} 



Ef= [j Sh,x U U Sca. 



Now \Shx\ = for all x G Vh, and ISq a\ = f^H for all a G Vq, and since the 
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sets are all disjoint, we have 



M = \Ef\ = 



U Sh,. U U SG,a 



U ^0,a 
aeVc 



xeVH 

xeVn aeVa 



Associativity of the Cartesian Product and a Generalisation to an 
Arbitrctry Number of Factors We can extend the definition of the Cartesian 

product to an arbitrary number of factors: {GiOG2)OGs) )OGr- If wc 

always represent the resulting product vertices by r-tuples and edges by pairs of 
r-tuples, then any bracketing in which a bracket contains a product of two graphs 
(either of which may be a product itself) will give the same product, that is, the 
Cartesian product is associative. We can therefore represent it unambiguously 
by F = G1OG2 . . . OGr- This assumes the order of the operands is kept the 
same. If the order is permuted, the tuple representing the vertex labelling will 
be permuted in the same way, but the two permutations will produce isomorphic 
products. 

For a natural number d, we denote by G^ the d'th Cartesian power, that is, 
G"^ = G when d = 1 and = G'^-^nG when d>l. 



6.1.3 Examples 

We give examples of Cartesian product of graphs, some of which are important 
to the proofs of this chapter. First, wc remind the reader of some specific classes 
of graphs, and define new ones: Let P„ denote the n-path, the path graph of 
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n vertices. Let represent the n-cycle, the cycle graph with n vertices. The 
Cartesian product of a pair of paths, PpOP^ is a p x g rectangular grid, and 
when p = q = n, is a.nxn grid, or lattice. The product of a pair of cycles 
ZpDZg is a toroid, and when p = g, is a torus. Both grids and toroids can be 
generalised to higher powers in the obvious way to give d— dimensional grids and 
toroids respectively, where d is the number of paths or cycles multiplied together, 
respectively. 

To give another - somewhat more arbitrary - example, a pictorial representation 



of the product of a triangle graph with a tree is given in Figure 6.1 



A □ ^ — ' 



y 




Figure 6.1: Cartesian product of a triangle with a tree. 



6.2 Blanket Time 

We introduce here a notion that is related to the cover time, and is an important 
part of the main theorem and the proof technique we use. 

Definition 14 ([72]). For a random walk Wu on a graph G = {V, E) starting at 
some vertex u & V , and 5 G [0, 1), define the the random variable 

Bs,u[G] = min{t : Vf G V, N^{t) > STC^t}, (6.1) 

where Ny{t) is the number of times Wu has visited v by time t and vr^ is the 
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stationary probability of vertex v. The blanket time is 

Bs[G]=maxE[Bs,u[G]]. 

u&V 

The following was recently proved in [3T] . 

Theorem 36 ([31]). For any graph G, and any 5 G (0, 1), we have 

B^IG] < k{5)COY[G] (6.2) 

Where the constant n{6) depends only on 5. 
We define the following 

Definition 15 (Blanket-Cover Time). For a random walk >V„ on a graph 
G = {V, E) starting at some vertex u & V , define the the random variable 

^u[G] = mm{t : \/v G V N^{t) > 7r^COY[G]}, 

where Nv{t) is the number of times Wu has visited v by time t and iXy is the 
stationary probability of vertex v. The blanket-cover time is the quantity 

BCOV[G] =maxE[/3„[G]]. 

Thus the blanket-cover time of a graph is the expected first time at each vertex v 
is visited at least 7r^COV[G'] times - which we shall refer to as the blanket-cover 
criterion. 

In the paper that introduced the blanket time, [72], the following equivalence 
was asserted, which we conjecture to be true. 

Conjecture 1. BCOY[G] =0{COY[G]). 
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In the same paper, this equivalence was proved for paths and cycles. However, 
we have not found a proof for the more general case. It can be shown without 
much difficulty that BCOV[G] = 0((COV[G])2). Using the following lemma, 
we can improve upon this. 

Lemma 37 ([53]). Let i and j be two vertices and k > 1. Let Wk be the number 
of times j had been visited when i was visited the k-th time. Then for every 
e > 0, 

Pr f W^, < (1 - ef-^k] < exp ( -—^^—] . 

We use it thus: 



Lemma 38. 

BCOV[G] = O ((log n) COM* [G]) 
where COM*[G] = max^^^^vic) COM[u,v]. 

Proof At time t some vertex i must have been visited at least Hit times, 
otherwise we would get t = "^^^y N^it) < '^^^y'n'vt = t, where Ny(t) is the 
number of times v has been visited by time t. 

We let the walk run for r = v4(log n)COM* [G] steps where A is a large constant. 



Some vertex i will have been visited at least vTjr times. Now we use Lemma [37 
with k = TTiT. Then for any j, 

Pr (wk < (1 - e)'^k] < exp ^ 



71, J - " V47r.COM[^,j] 

f —e'^Alogn 

< exp(^ 

< l/n" 

for some constant c > 1. Hence with probability at most l/n'^~^ the walk has 
failed to visit each vertex j at least vrjCOVfC] times (by Matthews' bound. 
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Factors 



Theorem 30). We repeat the process until success. The expected number of 
attempts is 1 + 0{n^~'^). 



□ 



6.3 Relating the Cover Time of the Cartesian 
Product to Properties of its Factors 

Notation For a graph F, denote by: 6^ the minimum degree; the average 
degree; Ar the maximum degree; Dr the diameter. 

The main theorem of this chapter is the following. 

Theorem 39. Let F = {Vf,Ef) = GUH where G = {Vg,Eg) and H = 
{Vh, Eh) are simple, connected, unweighted, undirected graphs. We have 



COV[F] > max | (^1 + COY[H], (^1 + COV[G]| . 

Suppose further that rin > Dq + 1, then 



(6.3) 



COV[F] <K (^(^1 + BCOV[i7] + 



COV[H]Dg 



(6.4) 



where M = \Ef\ = riGrriH + nnrriG, i = \og{DG + 1) log(?2G-DG) and K is some 
universal constant. 



The main part of the work is the derivation of (6.4); the inequality (6.3) is 



relatively straightforward to derive. Note, by the commutativity of the Cartesian 



product, G and H in the may be swapped in (6.4), subject to the condition 
nG>DH + 1. 



Theorem 39 extends much work done on the particular case of the two-dimensional 
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Factors 



toroid on v? vertices, that is, = Z„nZ„, culminating in a result of [29], which 
gives a tight asymptotic result for the cover time of Z^ as n — )■ oo. Theorem 



39 also extends work done in [52j on powers of general graphs G, which 
gives upper bounds for the cover time of powers of graphs. Specifically, it shows 
COV[G2] = OiQc'^W^) and for d > 3, COV[G'^] = 0(^gA^ log A^). Here 
N = n'^, is the number of vertices in the product, and = 2\E\/n is the aver- 



age degree of G. A formal statement of the theorem is given in section 6.4 and 



further comparisons made in section 6.5 



To prove the Theorem 39, we present a framework to bound the cover time of 
a random walk on a graph which works by dividing the graph up into (possibly 
overlapping) regions, analysing the behaviour of the walk when locally observed 
on those regions, and then composing the analysis of all the regions over the 
whole graph. Thus the analysis of the whole graph is reduced to the analysis of 
outcomes on local regions and subsequent compositions of those outcomes. This 
framework can be applied more generally than Cartesian products. 

Some of the lemmas we use may be of independent interest. In particular. 



Lemmas 47 and 48 provide bounds on effective resistances of graph products 



that extend well-known and commonly used bounds for the n x n grid. 



The lower bound in Theorem 39 implies that COVfCDi/] > COV[iJ] (and 
COY[GOH] > COY[G]), and the upper bound can be viewed as provid- 
ing conditions sufficient for COV[Gni7] = 0{BCOV[H]) (or CO\[GnH] = 
0(BCOV[G])). For example, since paths and cycles have BCOV[G] = e(COV[G]), 
then COV[ZpnZg] = e(COV[Zg]) = Q{q^) subject to the condition plog^p = 
0{q). Thus for this example, the lower and upper bounds in Theorem 39 are 
within a constant factor. 



Before we discuss the proof of Theorem 39 and the framework use to produce it 



we discuss related work, and give examples of the application of the theorem to 
demonstrate how it extends that work. 
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6.4 Related Work 

A d-dimensional torus on N = n'^ vertices is the (i'th power of an n-cycle, Z^. 
The behaviour of random walks on this structure is well studied. 

Theorem 40 (see, e.g., [57]). (i) COV[Zl] = QiNhg^N). 
(ii) COV[Z^] = 0(A^logA^) when d>3. 

In fact, there is a precise asymptotic value for the 2-dimensional case. 
Theorem 41 ([29j). COVfZ^] ~ iA^log^A^. 

The following result of [52] gives bounds on the cover time for powers of more 
general graphs: 

Theorem 42 ([52], Theorem 1.2). Let G = {V,E) be any connected, finite graph 
on n vertices with Oq = 2\E\/n. Let d > 2 be an integer and let N = n'^. For 
d = 2, COV[G"^] = OieaNhg^N) and for d > 3, COV[G'^] = OiecNlogN). 
These bounds are tight. 

[52] does not address products of graphs that are different, nor does it seem 
that the proof techniques used could be directly extended to deal with such 
cases. Our proof techniques are different, but both this work and [52] make use 
of electrical network theory and analysis of subgraphs of the product that are 
isomorphic to the square grid P^DP^. 

A number of theorems and lemmas related to random walks and effective re- 
sistance between pairs of vertices in graph products are given in [13]. To give 
the reader a flavour we quote Theorem 1 of that paper, which is useful as a 
lemma implicitly in this paper and in the proof of [22] Theorem 1.2 to justify 
the intuition that the effective resistance is maximised between opposite corners 
of the square lattice. 
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Lemma 43 ([13], Theorem 1). Let Pn he an n-vertex path with endpoints x and 
y. Let G be a graph and let a and b be any two distinct vertices of G. Gonsider the 
graph G x P„. The effective resistance R((a,x),(b,v)) is maximised over vertices 
V of Pn atv = y. 

For this is used twice: 

P((0, 0), (r, s)) < P((0, 0), - 1, s)) < P((0, 0), - 1, n - 1)). 



6.5 Cover Time: Examples and Comparisons 



In this section, we shall apply Theorem 39 to some examples and make compar- 
isons to established results. 



6.5.1 Two-dimensional Torus 



We shall apply the upper bound of Theorem 39 , to the 2-d torus, Z^: 
{i) G = H = Z„; 

(ii) Az„ = 5z„ = 2; 

(iii) mz„ = nz„ = n; 

(iv) /^.„ = LfJ; 

(v) Thus M = 2mz„n: 

(vi) I 

(vii) By Theorem 



= 2n^, and 

log(DG + l)log(nGZ^G) = log(LfJ +l)log(nLfJ). 
COViZJ = 



27 



(viii) BCOV[G] = e(COV[G]). 
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COV[F] < K 

= o(n^ 



2n^nnni'^ 



= 0{n'f) 
= O (n^ log'^ n) 
= 0{N\og'^N). 

This is a factor log^ out of the actual value - log^ of Theorem 



41 



Theorem 



42 



gives O(A^log^A^) bound. 



6.5.2 Two-dimensional Toroid with a Dominating Factor 



Theorem 39 does not cope well with squares, for which Theorem 42 provides 
strong bounds. Instead it is more effectively applied to cases where there is 
some degree of asymmetry between the factors. The previous example bounded 
COV[ZpnZq] for the case where p = q- If, however, plog^p = 0(g), then we 
get a stronger result. 

(i) G = Z„ 

(ii) H = Z, 

(iii) Ag = Ah = Sg = Sh = 2; 

(iv) BCOV[H] = 0(COV[i7]). 

(v) mG = nG = p; 

(vi) niH = nH = q; 

(vii) Dg=[1\. 
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(viii) Thus M = 2pq, and 

(ix) £ = log(L|J+l)log(pL|J)- 

(x) By Theorem 
Thus, 



27 



COViZ. 



p(p-i) 



2 and COV[Zj 



2 • 



2pqpqqi'^ 
q^p 

O {<f + pqlog^ p^ 



if plog'^p = 0(g). 



Comparing this to the lower bound of Theorem 39 



COV[F] > max 
which implies 



^ + ^) COY[H], + 1 ) COV[G] 



cov[F] = n (^(^-^ + 1^ coy[H]^ = n{q^ 



Thus, Theorem 39 gives upper and lower bounds within a constant a multiple for 
this example. That is, it tells us COV[ZpnZg] = e(COV[Zg]) = e(g2) subject 
to the condition plog^p = 0{q). Looking at it another way, it gives conditions 
for when the cover time of the product F = GOH is within a constant multiple 
of the cover time of one of it's factors. We describe that factor as the dominating 
factor. 
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6.6 Preliminaries 

6.6.1 Some Notation 

For clarity, and because a vertex u may be considered in two different graphs, 
we may use dc^u) to explicitly denote the degree of u in graph G. 

h{n) denotes the ra'th harmonic number, that is, h{n) = Yl^=i lA- Note h{n) = 
logn + 7 + 0{l/n) where 7 ~ 0.577. All logarithms in this chapter are base-e. 

In the notation (.,?/), the '.' is a place holder for some unspecified element, 
which may be different from one tuple to another. For example, if we refer to 
two vertices (.,a), (.,6) G GDH[S], the first elements of the tuples may or may 
not be the same, but (.,a), for example, refers to a particular vertex, not a set 
of vertices {{x,a) : a G V{G)}. 

6.6.2 The Square Grid 

The k X k grid graph where is the fc-path, plays an important role in our 
work. We shall analyse random walks on subgraphs isomorphic to this structure. 
It is well known in the literature (see, e.g. [SZ]) that for any pair of vertices 
M,f G V{P^), we have R{u,v) < Clog A; where C is some universal constant. 
We shall quote part of [52] Lemma 3.1 in our notation and refer the reader to 
the proof there. 

Lemma 44 ([52], Lemma 3.1(a)). Let u and v he any two vertices of P^. Then 
R{u,v) < 8h{k), where h{k) is the k'th harmonic number. 
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6.7 Locally Observed Random Walk 

Let G = {V,E) be a connected, unweighted (equiv., uniformly weighted) graph. 
Let S G V and let G[S] be the subgraph of G induced by S. Let B = {v G 
S : 3x ^ S, {v,x) G E}. Call B the boundary of S. and the vertices of V \ 5* 
exterior vertices. If v E S then dciv) (the degree of v in G) is partitioned into 
d{v,in) = \N{v,in)\ = \N{v)nS\ and d{v,out) = \N{v,out) \ = \N{v)n{V\S)\, 
{inside and outside degree). Here N{v) denotes the neighbour set of v. 

Let u,v G B. Say that u,v are exterior- connected if there is a {u,v)-paih 
u,xi, ...Xk,v where Xi G V\S,k > 1. Thus all vertices of the path except u, v are 
exterior, and the path contains at least one exterior vertex. Let A{B) — {{u, v) : 
u,v are exterior-connected }. Note A{B) may include self-loops. 

Call edges of G[S] interior, edges of A{B) exterior. We say that a walk u) — 
{u, xi, ...Xk, v) on G is an exterior walk if u,v G S and Xi ^ S, 1 < i < k. 

We derive a weighted multi-graph H from G and S as follows: V{H) — S, 



E{H) = E{G[S])UA{B). Note if li,^; G B and {u,v) G then {u,v) G E{G[S]), 



and if, furthermore, u,v are exterior connected, then {u,v) G A{B) and these 
edges are distinct, hence, H may not only have self-loops but also parallel edges, 
i.e., E{H) is a multiset. 

Associate with an orientation (u'^v) of an edge {u, v) G A{B) the set of all exterior 
walks uj — {u, xi, ...Xk, v), k > 1 that start at u and end at v, and associate with 
each such walk the value p{uj) = 1 / {dG{u)dG{xi) . . .d^x k)) (note, the d{xi) is not 
ambiguous, since Xi ^ E{H), but we leave the 'G" subscript in for clarity). This 
is precisely the probability that the walk uu is taken by a simple random walk on 
G starting at u. Let 






(6.5) 



k>l ijj={u,xi...Xk,v) 
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where the sum is over all exterior walks uj. 

We set the edge conductances (weights) of H as follows: If e is an interior edge, 
c(e) — 1. If it is an exterior edge e = (w, v) define c(e) as 



Thus the edge weight is consistent. A weighted random walk on H is thus a 
finite reversible Markov chain with all the associated properties that this entails. 

Definition 16. The weighted graph H derived from {G, S) is termed the local 
observation of G at S, or G locally observed at S. We shall denote it as H — 



The intuition in the above is that we wish to observe a random walk W{G) on a 
subset S of the vertices. When W{G) makes an external transition at the border 
B, we cease observing and resume observing if/when it returns to the border. 
It will thus appear to have transitioned a virtual edge between the vertex it left 
off and the one it returned on. It will therefore appear to be a weighted random 
walk on H. This equivalence is formalised thus 

Definition 17. Let G be a graph and S C V{G). For an (unweighted) random 
walk yV{G) on G starting at xq G S, derive the Markov chain Ai{G,S) on 
the states of S as follows: (i) A4{G,S) starts on Xq (ii) If W{G) makes a 
transition through an internal edge {u,v) then so does A4{G,S) (iii)If W{G) 
takes an exterior walk ui — {u,xi...Xk,v) then A4{G,S) remains at u until the 
walk is complete and subsequently transitions to v. We call A4{G,S) the local 
observation of W{G) at S, or W(G) locally observed at S. 
Lemma 45. For a walk yV{G) and a set S C V{G), the local observation of 
W{G) at S, A4{G,S) is equivalent to the weighted random walk yV{H) where 





1 



dG{v)pH{vTu) (6.6) 



k>l ui={u,xi...Xk,v) 




Loc{G,S). 



H^Loc{G,S). 
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Proof The states are clearly the same so it remains to show that the tran- 
sition probability Pm{ij,v) from u to v in Ai{G, S) is the same as Py^;(^H){u,v) in 
yV{H). Recall that B is the border of the induced subgraph G[S]. If u ^ B then 
an edge [u, v) G E[H) is internal and so has unit conductance in /J, as it does in 
G. Furthermore, for an internal edge e, e G E{H) if and only if e G E(G), thus 
duiu) = dciu) when u ^ B. Therefore Py^(^H){u,v) = l/dniu) — l/daiu) — 
Pm{u,v). 

Now suppose u e B. Let E{u) denote the set of all edges incident with u in 
H and recall A{B) above is the set of exterior edges. The total conductance 
(weight) of the exterior edges at u is 

Cij(e) = ^ Pr(walk from x returns to B at v) 

eeE{u)nA{B) xeN(u,out) veB 

= E 1 

x&N{u,out) 

— d{u,out). 

(Note the H subscript in c//(e) above is redundant since exterior edges are only 
defined for H, but we leave it for clarity). 

Thus for ii G -B 

ch{u) = J2 ^H{e) = 5^ 1 + 5^ ^H{e) 

eeE{u) eeE{u)nG[S] eeE{u}nA{B) 

— d{u, in) + d{u, out) 
= dciu) 



Now 



Pm{u,v) = l{^u,v)eG[S]}^-^ + y^ , / X , / "^x , / X (6.7) 
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where the sum is over all exterior walks u). Thus 

1 



Pm{u,v) = '^{{u,v)eG[s]}j—r^ +PHiu7v) (6. 

dG[u) 



PW{H){U,V) = ^ [l|(„_„)eG'[S]} + l{(«,i>)GA(i?)}C//(M,^^)] (6.9) 

[^{{u,v)eG[s]} + '^{iu,v)eA(B)}dG{u)pH{u7v)] (6.10) 



dciu, 

l{(«,D)eG[5]}^^^^^ + '^{iu,v)(^A{B)}PH{u7v) (6-11) 

Pm{u,v) (6.12) 

□ 



6.8 Effective Resistance Lemmas 



For the upper bound of Theorem 39 we require the following lemmas. 



Lemma 46. Let G be an undirected graph. Let G' C G, be any subgraph such 
that such that V{G') = V{G). For any u,v e V{G), 

R{u,v) < R'{u,v) 

where R{u,v) is the effective resistance between u and v in G and R'{u,v) sim- 
ilarly in G' . 

Proof Since V{G') = V{G), G' can be obtained from G by only removing 



edges. The lemma follows by the Cutting Law (Lemma 20). □ 



Denote by Rmax{G) the maximum effective resistance between any pair of ver- 
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tices in a graph G. 

Lemma 47. For a graph G and tree T, Rmax{GOT) < ARmaxiG^Pr) where 
\V{T)\ < r < 2|\^(T)| and Pr is the path on r vertices. 

Proof 

Note first the following: 

(i) By the parallel law, an edge (a, b) of unit resistance can be replaced with 

two parallel edges between a, b, each of resistance 2. 

(ii) By the shorting law, a vertex a can be replaced with two vertices ai, 02 with 

a zero-resistance edge between them and the ends of edges incident on a 
distributed arbitrarily between oi and 02- 

(iii) By the same principle of the cutting law, this edge can be broken without 
decreasing effective resistance between any pair of vertices. 

Transformations (i) and (ii) do not alter the effective resistance R{u, v) between 
a pair of vertices u, v in the network. For any vertex u ^ {ai, 02, a}, R{u, ai) = 
R{u, 02) and these are equal to R{u, a) before the operation. 

Points (ii) and (iii) require elaboration. In this thesis, we do not define zero- 



resistance (infinite conductance) edges. As stated in section 4.4.1, to say that 



a zero- resistance edge is placed between ai and 02, is another way of referring 



to shorting as defined in Lemma 21, It would seem then, that (ii), in fact says 
nothing. However, it serves as a useful short hand for talking about operations 
on the graph when used in conjunction with (iii). If (ii) and (iii) are always 
used together, that is, if a zero-resistance edge created from (ii) is always cut 
by (iii), then this is equivalent to the reverse of process of shorting two vertices 



ai and 02 into 03, as per Lemma [21} Hence, these two operations together are 
sound. 

We continue thus: 
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1. Let F = Got. Let each edge of F have unit resistance. In what follows, 
we shall modify F, but shall continue to refer to the modified graphs as F. 

2. Starting from some vertex v in T, perform a depth-first search (DPS) of T 
stopping at the first return to v after all vertices in T have been visited. 
Each edge of T is traversed twice; once in each orientation. Each vertex x 
will be visited d{x) times. 

3. Let (cj) be the sequence of oriented edges generated by the search. The 
idea is to use (cj) to construct a transformation from F = GOT to GOP^.. 
From (cj), we derive another sequence (aj), which is generated by following 
(cj) and if we have edges ej,ej+i with Cj = (a, 6), Cj+i = {b,c) such that 
it is neither the first time nor the last time b is visited in the DFS, then 
we replace ej,ej_|_i with (a,c). We term such an operation an aggregation. 
Observe that in the sequence (oj), all leaf vertices of T appear only once 
(just as in (cj)), and a non-leaf vertex appears twice. 

4. By (i) above, we can replace each (unit resistance) edge in F by a pair of 
parallel edges each of resistance 2. 

5. For a pair of parallel edges in the T dimension, arbitrarily label one of 
them with an orientation, and label the other with the opposite orienta- 
tion. Note, orientations are only an aid to the proof, and are not a fiow 
restriction. We therefore see that (ej) can be interpreted as a sequence of 
these parallel oriented edges. 

6. We further modify F using (oj): If (a, 6),(6, c) was aggregated to (a,c), 
then replace each pair of oriented edges {{x,a), {x,b)) and ((x, 6), {x,c)) 
in F with an oriented edge {{x, a), {x, c)). The resistances of {{x, a), {x, b)) 
and ((x, b), {x, c)) were r((x, a), {x, b)) = 2 and r((x, 6), (x, c)) = 2. Set the 
resistance r((x, a), {x, c)) = r((x, a), {x, b)) -\- r{{x, b), {x, c)) = 4. 

7. The above operation is the same as restricting fiow through ((x, a), (x, b)) 
and {{x , b) , {x , c)) to only going from one to the other at vertex {x,b), 
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without the possibihty of going through other edges. The infimum of the 
energies of this subset of flows is at least the infimum of the energies of the 
previous set and so by Thomson's principle, the effective resistance cannot 
be decreased by this operation. 

8. For each copy of G in F excluding those that correspond to a leaf of 
T, we can create a "twin" copy G[. Associate with each vertex x e V{F) 
(except those excluded) a newly-created twin vertex x' with no incident 
edges. Thus, V{Gi) has a twin set V{G'j), though the latter has no edges 
yet. 

9. Recall the parallel edges created initially from all the edges of F\ wc did 
not manipulate those in the G dimension, but wc do so now: redistribute 
half of the parallel edges of Gi in the G dimension to the set of twin vertices 
V{G'^ so as to make G'^ a copy of G (isomorphic to it). Now put a zero- 
resistance edge between x and x' . By (ii), effective resistance is unchanged 
by this operation. 

10. We now redistribute the oriented parallel edges in the T dimension so as 
to respect the sequence (oj). We do this as follows: follow the sequence 
(cj) by traversing edges in their orientation. Consider the following event: 
In the sequence (oj) there is an element aj — {a,b) and b has appeared in 
some element Oj such that i < j. Then Uj is the second time that b has 
occurred in the sequence. Now change each edges ((x, a), (x, 6)) e F to 
{{x, a), {x, b)'). lib = V, then stop; otherwise, aj is followed by Oj+i = (6, c), 
for some c G V{T). In this case, also change all {{x,b),{x,c)) e F to 
{{x,by,{x,c)). Continue in the same manner to the end of the sequence 
(ai). 

11. We then remove the zero-resistance edges between each pair of twin ver- 
tices, and by (iii), this cannot decrease the effective resistance. 

Using the sequence (oj) to trace a path of copies of G, we see that the resulting 
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structure is isomorphic to G^Pr. Since the aggregation process only aggregates 
edges that pass through a previously seen vertex, r is at least |\^(T)|. Also, be- 
cause each edge is traversed at most once in each direction, r is at most 2|V(T)|. 
Each edge has resistance at most 4, and so the lemma follows. □ 

Lemma 48. For graphs G, H suppose Dq + 1 < uh < ol{Dg + 1), for some a. 
Then Rmax{GOH) < (alog^Dc + 1), where ( is some universal constant. 

Proof Let (a, x), {b, y) be any two vertices in GOH. Let D be some diamet- 
ric path of G. Let (a, D) represent the shortest path from a to D in G (which may 
trivially be a if it is on D). Similarly with (6, D). Let = D U {a, D) U {b, D). 
Let k = Dc + 1. Note k < \V(Td)\ < 3k. Now let Th be any spanning tree of 



H. Applying Lemma 47 twice we have 



where k < r < Qk and k < s < 2ak. Considering a series of connected P^ sub- 



graphs and using Lemma 44 and the triangle inequality for effective resistance, 
we have Rmax{Pr^Ps) < 16(6 -|- 2a)8/;,(/c), where h{k) is the fc'th harmonic num- 
ber. Since T^OTh C GOH, the lemma follows by Lemma 46 □ 



A diametric path D is involved in the proof of Lemma 48 because the use of D 
means that the dimension of Pr is effectively maximised, and we can break up 
the grid P,.nPs roughly into k x k square grids, each with maximum effective 
resistance 0(logA;) = OilogDc)- If, for example, the shortest path between a 
and b is used, the product P^DP^ may have r much smaller than s, looking like 
a long thin grid, which may have a high effective resistance. 



6.9 A General bound 



In this section, we prove Theorem 39 , starting with the lower bound 
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6.9.1 Lower Bound 



The following is a partial restatement of Theorem 39 for the lower bound (in- 



equality (6.3)). 



Theorem 39 (partial restatement) Let F = (Vp, Ep) = GOH where 
G = (Vg, Eg) and H = (Vh, Eh) are simple, connected, unweighted, undirected 
graphs. We have 

COV[F] > max | (^1 + COY[H], (^1 + COV[G]| . (6.13) 



Proof In order for the walk W to cover F, it needs to have covered the 
H dimension of F. That is, each copy of G in F needs to have been visited at 
least once. The probability of a transition in the H dimension is distributed as 

-. Thus, the 



a geometric random variable with success probability at most ■ 
expectation of the number of steps of W per transition in the H dimension is 
at least Transitions of W in the H dimension are independent of the 

location of W in the G dimension, and have the same distribution (in the H 
dimension) as a walk on H. This proves 



COV[F] > 1 + 



5, 



G 



A 



H 



COY[H]. 



By commutativity. 



6 



COV[F] > ( 1 + ^ ) COV[G]. 



□ 
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6.9.2 Upper Bound 



The following proves the upper bound in Theorem 39 It is envisaged that theo- 
rem is used with the idea in mind that G is small relative to H, and so the cover 
time of the product is essentially dominated by the cover time of H . We give a 



partial restatement of the theorem for the upper bound (inequality (6.4)). 



Theorem 39 (partial restatement) Let F = (Fp, Ep) = GOH where G = 
{Vgi Eg) and H = {Vh, Eh) are simple, connected, unweighted and undirected. 
Suppose further that riu > -Dg + 1- 

where i = \og{DG + 1) loglncDc) and K is some universal constant. 

Proof Let k = Dq + 1- We group the vertices of H into sets such that 
for any set S and the subgraph of H induced by 5, -^[5"]: (i)|5'| > k, {ii)H[S] 
is connected, (iii) The diameter of H[S] is at most Ak. We do this through 
the following decomposition algorithm on H: Choose some arbitrary vertex v G 
V{H) as the root, and using a breadth-first search (BFS) on H , descend from 
V at most distance k. The resulting tree T{v) C H will have diameter at most 
2k. For each leaf I of T(f ), continue the BFS using / as a root. If T{1) has fewer 
than k vertices, append it to T{v). If not, recurse on the leaves of T{1). The 
set of vertices of each tree thus formed satisfies the three conditions above. The 
root is part of a new set, unless it has been appended to another tree. 

In the product F we refer to copies of G as columns. In F we have a natural 
association of each column with the set 5* C V{H) defined above. We define 
Block[S] = {GnH[S]). 



[Refer to section 6.6.1|for a reminder of the notation (., y)]. For any two vertices 



, a), (., 6) G GOH[S] there exists a tree T(a, b) subgraph of the tree T in H that 
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generated S such that a and b are connected in T(a, b) and k < \V{T{a, b))\ < 4k. 
Then using Lemmas 48 and |46} we can upper bound the effective resistance 
i?((.,a),(.,6)) mB = Block[S], 



(6.15) 



Furthermore, if B' = Loc{F, V{B)) {Loc is defined in Definition 16), then B C B' 



so by Lemma 46 



RmaAB')<^\ogiDG + l). (6.16) 

We use the following two-phase approach to bound the cover time of F = GOH. 

Phase 1 Perform a random walk VV(-F) on F until the blanket-cover criterion 
is satisfied for the H dimension. 

Phase 2 Starting from the end of phase 1, perform a random walk on F until 
all vertices of F not visited in phase 1 are visited. 

Phase 1 can be thought of in the following way: We couple W{F) with a walk 
W{H) such that (i) if W(F) starts at (.,x), then W{H) starts at x, and (ii) 
yV{H) moves to a new vertex y from a vertex x when and only when VV(-F) 
moves from (.,x) to (.,?/). This coupled process runs until W(-ff) satisfies the 
blanket-cover criteria for H, i.e., when each vertex v G V{H) has been visited 
at least tt{v)CO\[H] times. An implication is that the corresponding column 
Gy in F will have been visited at least that many times. 

Having grouped F into blocks, we analyse the outcome of phase 1 by relating 
yV{F) to the local observation on each block. A particular block B will have 
some vertices unvisited by W{F) if and only if }V{F) locally observed on B fails 
to visit all vertices. We refer to such a block as failed. Consider the weighted 
random walk yV{B') on B' = Loc{F,V{B)). This has the same distribution as 
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VV(-F) locally observed on B. Hence, we bound the probability of W(-F) failing 
to cover B by bounding the probability that yV{B') fails to cover B'. Done for 
all blocks, we can bound the expected time it takes phase 2 to cover the failed 
blocks. We think of phase 1 as doing most of the "work", and phase 2 as a 
"mopping up" phase. Mopping up a block in phase 2 is costly, but if there are 
few of them, the overall cost is within a small factor of phase 1. 

We bound Pr(>V(-B') fails) by exploiting the fact that yV{B') will have made 
some minimal number of transitions t. This is guaranteed because phase 1 
terminates only when W{H) has satisfied the blanket-cover criterion on H. If k 
counts the number of steps of a walk VV(-B') until B' is covered, then 

Pt{W{B') fails to covert') < Pr(/s; > t) < (6.17) 
by Markov's inequality. 



Definition 18. For graphs I = JOK, and S C V{I), denote by S.K the pro- 
jection of S on to K, that is, S.K = {f G : (., f ) G 5}. 

For a weighted graph G, recall that c(G) is the twice the sum of the conductances 



(weights) of all edges of G (refer to section 2.1.1). By the definition of G^H[S\ 



and section 6.7 



c{B') <mG\V{B).H\+nG d{u) 

ueV{B).H 



(6.18) 



Using (6.16) and Theorem 22 we therefore have for any m, f G V{B') , COM [u,v] < 
Kc{B') log{DG + 1) for some universal constant K. (In what follows K will 
change, but we shall keep the same symbol, with an understanding that what 
we finish with is a universal constant). Hence, by Matthews' Technique (Theo- 
rem 30), 

COV[B'] < Kc{B')\og{DG + l)\ogi\ViB')\). (6.19) 
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For a block B, the number of transitions on the H dimension - and therefore 
the number of transitions on i? - as demanded by the blanket-cover criterion is 
at least 



ueV{B).H 



(6.20) 



ueV{B).H 

where tth{u) and dniu) denote the stationary probability and degree of u in H. 
Now 



Pr(Vy(F) fails on B) = Pr{W{B') fails on B') 

< Kc{B')\og{DG + l)\og{\V{B')\)/T, (6.21) 



as per (6.17). For convenience, we left Ib = \og{DG + l)\og{\V{B)\) (recall 
V{B) = V{B')). Hence, using ( |6^ with dOS) ) and ( [a2l] ). 



Pr(>V(F) fails on B) < 



KIbttih T^G\ViB).H\ + nGj2u&v(B).Hdiu 



CO\[H] 
COV[H] 



Y.ueV{B).H^H{.u) 

mG\V{B).H\ 

YjueV{B).H'^H{u) 



(6.22) 



Phase 2 consists of movement between failed blocks, and covering a failed block 
it has arrived at. The total block-to-block movement is upper bounded by the 
time is takes to cover the H dimension of F (in other words, for each column to 
have been visited at least once). We denote this by COV f[H]. Let COV f[B] 
denote the cover time of the set of vertices of a block B by the walk >V(-F). Let 
the random variables 0i and 02 represent the time it takes to complete phase 1 
and phase 2 respectively. 
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E[02] < COV^[iJ] + J2 Pr(>V(F) fails on B)COVf[B]. 



B&F 



For W{H), the random variable Ph = min{t : {'iv)N^{t) > 'k{v)COV[H]} 
counts the time it takes to satisfy the blanket-cover criterion on H. 

The expected number of movements on F per movement on the H dimension is 
at most {Ag + Sh)/Sh- Therefore, 

Oh Oh 

Similarly, 

COV^[iJ] < ^g + ^^ cOViiJ]. 

Oh 



Using (6.15), Lemma 46, and Theorems 22 and 31 on B, we have 



COVf[B] < K'c{F)Ib (6.23) 



where c(F) = 2\E{F) \ = 2M. 
Hence, 

COV[F] < E[0i] + E[02] 



< K^^^^BCO\[H] + y Pr(>V(F) fails on B)CO\f[B]. 

Oh — ^ 

BeF 



We have, using (6.22) and (6.23) 



Pr(W(F) fails on B)COVAB] < L + ^^^^ 

BeF ^^^^^JbsfV 2^ueV{B).H 



mG\V{B ).H\ 
(6.24) 
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Since ^ueV{B) h '^(^) — outer summation in (6.24) can be bounded 

thus 

E i^o + ] H < ma\og\Da + l)Y. \og\\Vm)- (6.25) 

B^f\ 2^u€V{B).h(^H[U) j 

Since each block B E F has at least -Dg + 1 columns, we can upper bound the sum 



in the RHS of (6.25) by assuming all blocks have this minimum. The number 



of such blocks in F will be \V{H)\/{Dg + 1), each block having {Dq + 1)^^ 
vertices. Hence 



Y.\og{\Vm? <'^^og\na{Dc + l)). 



(6.26) 



Putting together (6.24), (6.25) and (6.26), we get 



Pr(W(F) faUs on B)COV ^B] < A' ^^^f 



B6F 



where i = log^Dc + 1) iog^ncDc) 



□ 
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Chapter 



7 



Random 



Graphs of a Given 



Degree Sequence 



In this chapter we study the asymptotic cover time of random graphs that have a 
prescribed degree sequence, that is, for a graph with vertex set V — {1,2, ... 
with n — >■ oo, we have a sequence d = {di,d2, . . . , of positive integers where 
di is the degree of vertex i and we wish to determine the cover time of a graph 
picked uniformly at random from the set of all connected simple graphs of de- 
gree sequence d. Denote by ^(d) the space of all such graphs with a uniform 
distribution on them. Thus, we study the cover time of a graph G picked from 
^(d). We may relax our terminology slightly and say that a graph G is picked 
ueir from the set ^(d), even though ^(d) is not a set but a set with an associated 
distribution on the elements. 

It must be noted that not every sequence of n positive integers will allow for a 
simple graph, nor even a graph. We know, for example, for the sequence to be 
graphical, the sum of the degrees has to be even, i.e., XlILi ~ for some 
natural number m. With this condition, a graph having the prescribed sequence 
may exist, but may not be simple. For example, a 2- vertex graph with degree 
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sequence (1, 3) will force multiple edges and/or loops. Nevertheless, the study of 
the types of graphs can be facilitated by means of an intimately related random 



process known as the configuration model, which is explained in section 7.3.1 
and is treated in many sources, including, e.g., [49j. 

We reiterate some definitions: a statement V{n) parameterised on an integer 
n holds with high probability (whp) if Pr{V{n) is true) — )■ 1 as n — > oo; the 
notation f{n) ~ g{n) means f{n)/g{n) — > 1 as n — t- oo. 

Our study of the cover time imposes certain technical restrictions on the degree 
sequence under consideration in addition to the Yl^=i — restriction already 
mentioned. A degree sequence d satisfying these restrictions, which are described 
in section [734 is called nice. 



We denote by 6 the average vertex degree, i.e., 6 = 2m /n, and by d the effective 
minimum degree. The latter is a fixed positive integer, and the first entry in the 
ordered degree sequence which occurs G(n) times, meaning that for any d' < d, 
there are o(n) vertices with degree d'. The significance of the effective minimum 



degree is discussed in 7.4 



Theorem 49. Let G be chosen uar from Q{d), where d is nice. Then whp, 

COV[G]~^^nlogn. (7.1) 

The logarithms are base-e, as they are in the rest of this chapter, unless stated 
otherwise. 

We note that ii d 9, i.e., the graph is pseudo-regular, then 

COV[G]~^nlogn, 
which is the same asymptotic limit for random ci-regular graphs given in 
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7.1 Random Graphs: Models and Cover Time 

The study of random graphs goes back to [31] and [H] and has since become 
a highly active field of research. Those two models have proved to be a highly 
fertile ground in which to develop ideas, tools and techniques for the study of 
random graphs. Despite the fact that these two models were introduced by 
different groups of authors, both of them have come to be referred to as Erdos- 
Renyi (E-R) random graphs, after the authors of 



Subsequent models attempted to deal with the shortcomings of Erdos-Renyi in 
accurately capturing the structural properties of "real world" networks. This 
has been an area where mathematical theory and empirical study have been 
mutually beneficial to each other; theory has served to deepen understanding of 
"real-world" networks, and empirical research has generated data that has led 



to the developments of theory. Section 7.1.3 gives examples 



The classical work on random graphs is [12]. Another popular example is 
but there are many others. 

In the next section, we describe some models of random graphs as well as cover 
time results on them. We start with the two original ones of [3l] and [H]. It 
should be noted, that in all graphs we consider, vertices are labelled, and thus 
two graphs which may be indistinguishable without labelling, will be different 
objects with labelling. 

Before doing so, we define the term graph space. A graph space ^ is a set of 
graphs together with an assignment of probability p{G) to each graph G E Q 
such that TliG&g'P^^) ~ ^-^^ ^ probability distribution. When we say 

we pick a random graph G G ^, we mean we are picking it from the set with 
probability p{G). 
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7.1.1 Erdos-Renyi 

In the model of [34J, the graph space Q{n,m) is the set of all graphs with n 
(labelled) vertices and m edges, together with a uniform distribution on the set. 
Hence, a graph G is picked uniformly at random from all graphs on n vertices 
and m edges. In the model of |41j, there are n (labelled) vertices and each of 
the (2) possible edges exists with some fixed probability p, independently of the 
others. This graph space, denoted by Q{n,p) contains every graph on n vertices, 
but the distribution on them is not uniform. For a particular graph G on n 
vertices, if the edge set E{G) is such that |-E(G)| = m, then the probability of 
G being picked from Q (n, p) is 

p-(l_p)(2) — . 

Note, that whilst these graphs will always be simple, they may not be connected. 
Conditions for connectivity were studied with the introduction of the models, 
and continued to be thereafter, becoming a major area of focus for research on 
these models. 

A cover time result for Q{n,p) was given in [5T] : 

Theorem 50 ([51j). For G E Q{n,p), whp, 

(i) //jf^^oo thenCOV[G] = {l + o{l))nlogn. 

(ii) // c > 1 is a constant and np = clogn then COV[G] > (1 + a)nlogn for 

some constant a = a(c). 

This result was then strengthened in [21] : 

Theorem 51 ([2T]). Suppose that np = clogn = log?i + u where u = {c — 
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1) logra — )■ oo and c = 0(1). If G E Q{n,p), then whp, 



COY[G] ~ clog ( ^— y ]n\ogn. 



c 



7.1.2 Random Regular 

Let r be a positive integer. A random r-regular graph G on n vertices is a graph 
picked uar from the set of of all r-regular graphs on n vertices. The graph space 
is denoted by Q{n,r). 

The cover time for random regular graphs was studied in |20j : 
Theorem 52 ([20]). Let r > 3 be a constant. For G G Q{n,r), whp, 

COVfC] ~ ^-^nlogn. 
r — 2 



7.1.3 Other Models 



The cover time of a particular generative model of the preferential attachment 
graph is studied in [22]. In this model, at each time step, a new vertex v is added 
to the graph, and a fixed number m edges are randomly added between v and 
the existing vertices. The probability of adding to a vertex u is in proportion to 
the degree of u at that time in the process. The cover time was determined to 
be asymptotically equal to log where n is the final number of vertices in 
the graph. 

This model was suggested by [8J as a means of generating graphs with the scale- 
free property, which is the name given in the same paper to graphs having 
a power-law degree distribution. This is one in which P{k) oc k"'^ for some 
constant 7, where P{k), the fraction of vertices with degree k. Such a property 
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is thought to exist in many "real world" networks, such as the WWW and actor 
collaboration networks [S] , and the Internet [33| 

In a random geometric graph ([66]), the n vertices are scattered uar on (some 
subset of) a (i-dimensional space where d > 2, and an edge is placed between 
vertices u and v if the Euclidean distance between them is at most some fixed 
constant r, often called the radius. This type of random graph for d = 2 has 
been used as a model of wireless ad-hoc and sensor networks ([M], [6], |16j ) 
where the radius represents the radio communication range of devices that are 
placed randomly on the plane. Two recent papers deal with the cover time of 
random geometric graphs: |27] and [7]. 



7.2 Mixing Time, Eigenvalues and Conductance 

In this section we discuss parameters that are related to random walks on graphs. 
The concepts introduced here play a fundamental role in the proof Theorem 149} 



In chapter |3} we discussed the convergence of a random walk W on a graph G to 
a unique stationary distribution vr. The formal statement was made in Theorem 
10 which in turn was based on Theorem [7] for Markov chains more general than 



random walks on undirected graphs. 

Although these theorems asserted that a walk would converge to a stationary 
distribution (given certain conditions), there was no mention of how quickly this 
convergence happens. That is, there was no mention of how close to stationarity 
was the distribution of the random walk W after some number of steps t, nor 
how many steps were required to get close to stationarity (for some well-defined 



^The model of [S] was analytically determined by to have P{k) cc k for all k < n^^^^ 
where n is the final number of vertices in the model. This closely matched simulation results 
of [8] and [9] which gave values for 7 of 2.9 ± 0.1. As a comparison, experimental studies for 
the WWW [3] suggests 7 « 2.1 and 7 « 2.45 for the in-degree and out-degree respectively. 
Similarly, experimental studies for the Internet in |35] suggest 7 between 2.15 and 2.20. 



94 



7.2 Mixing Time, Eigenvalues and Conductance 



meaning of "close"). There are a number of related definitions of "closeness" 
between one distribution and another, and also a number of related definitions 
of "quickness" for convergence of distributions, but they are similar enough to 
one another that they all reflect the fundamental behaviour of a walk on the 
graph in roughly the same way. The rate of convergence of the walk is called 
the mixing rate, and the time it takes the walk to get close to stationarity is 
called the mixing time. A walk is rapidly mixing if the mixing time is somehow 
small compared to the size of the graph - say, polylogarithmic in the number of 
vertices. We shall define these notions in precise terms (see, e.g., or 
but first we shall introduce the role of eigenvalues in the study of random walks 
on graphs. 



7.2.1 Theory and Application of the Spectra of Random 
Walks 

Recall that for the matrix P of transition probabilities of a random walk on a 
graph G, the t-step probabilities are given by P*. This suggests that the set of 
eigenvalues of P - the spectrum - and their associated eigenvectors may have some 
important role. Indeed, we have already seen one particularly important left- 
eigenvector, the stationary distribution vr: ttP = tt. Note further that PI = 1, 
where 1 is the column vector of n elements with every element 1. 

The spectral theory of the transition matrix allows us to prove a convergence in 
distribution, and as we shall see, also give bounds on how close the distribution 
at time t is to the limit give some starting distribution p. This is given in terms 
of a distance between pP* and vr, bounded by a function of eigenvalues and t. 

We follow the presentation given in [59]. Recall the definition of the adjacency 



matrix given in section 2A_ Let A be the adjacency matrix and P be the 
transition matrix for a connected, simple, undirected and unweighted graph G. 
Let n = \V{G)\, and without loss of generality, let us assume the vertices of G 
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are labelled 1 to n. Let D be the diagonal nx n matrix such that Dj^j = l/d{i), 
the degree of vertex i. Observe P = DA. 

Although A is symmetric, P will not be unless G is a regular graph. In order to 
use the powerful tools of Spectral Theory, we require a matrix to be symmetric. 
We therefore use the related matrix N = D^/^AD^/^ = D^-^/^PD^/^, which is, 
in fact, symmetric. 

Proposition 4. For a real, symmetric nxn matrix 'M., by the Spectral Theorem, 

1. Eigenvectors ofM. with different eigenvalues are orthogonal; eigenvectors 
with the same eigenvalue need not be. 

2. M has a full orthonormal basis of eigenvectors vi, V2, . . . , Vn, with corre- 
sponding eigenvalues Ai, A2, . . . , A„. All eigenvalues and eigenvectors are 
real. 

3. M is diagonalisable: 

M = EAE^ 

where the columns of E are the orthonormal basis Vi, V2, . . . , Vn, and A 
is a diagonal matrix with entries corresponding to the eigenvalues of the 
columns of E (in corresponding order). Thus, M can be expressed in the 
following form: 

n 

M = J]A,ViVi^. 

i=l 

See, for example, for details. 

Since N is real and symmetric, then by Proposition |4] it has the form 

n 

N = ^ A,v,vf , 

where Ai > A2 > . . . A„ and the Vi form an orthonormal set. Consider the column 
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vector w with Wj = ■>/ d{i), where d{i) is the degree of vertex i. Observe 

Nw = D-^/2pj3i/2^ 
= D i/2pi 
= D-i/^l 
= w. 

Thus w is an eigenvector of N with size = d{i) = \/2m, where 

m = \E{G)\ is the number of edges in G. Since this eigenvector is positive, by 
the Perron-Frobenius theorem, the eigenvalue associated with it is unique and 
strictly larger than the second largest eigenvalue. Furthermore, it is at least the 
absolute size of the smallest eigenvalue (which may be negative). That is, for 
the eigenvalues Ai, A2, . . . , A„ of N, 

1 = Ai > A2 > . . . > A„ > -1 and Ai > |A„|. (7.2) 

It therefore follows that vi = w/||w||, i.e., vi[z] = ^yd{i)/2m = y/Wi. 
For a proof of the following proposition, see, for example j58] . 

Proposition 5. If G is non-bipartite, then Xn > —1 

The following is from [59] (with modifications for consistency of notation). 

n n 

pt ^ j3i/2^<j3-i/2 ^ ^ AlDi/VkVk^D-1/2 = Q + ^ AtD^/ 

k=l k=2 

where Qjj = tTj. That is, 

= PL- = + E ^l^kHvkbl (7.3) 
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If G is not bipartite, then |Aa;| < 1 for 2 < k < n and so 

(t) , 
Pi,j ^ '^i as r — oo. 

This proves Theorem [lO} and moreover, demonstrates how the eigenvalues are 
related to the speed of convergence of the walk to the stationary distribution. 



Thus, from equation (7.3), we can see that for any i, j 



|pS-.,l./fA'.. (T.4, 

where A,,, = max{A2, |A„|}, since A2 > A^ for 2 < A; < n. The quantity Ai — A2 = 
1 — A2 is called the spectral gap and the quantity Ai — A* = 1 — A,,, is called 
the absolute spectral gap, and bounding these quantities is a common means of 
bounding mixing time. 



7.2.2 Conductance 

The conductance we refer to in this section and for the rest of this chapter is 
not the same concept as that used in conjunction with electrical network theory, 
where it refers to weight of an edge in a network. It will be suitable for our 
purposes to first state the definition in terms of Markov chains, and then reduce 
it for random walks on graphs. Definitions are given in e.g., [32], [57] and [59] . 

Definition 19 (Conductance). Let Ai he an irreducible, aperiodic Markov 
chain on some state space Vt. Let the stationary distribution of M. be tt with 
tt{x) denoting the stationary probability of x G Q. Let P be the transition matrix 
for Ai. For x,y E Vt let Q{x,y) = 7!'{x)P[x,y] and for sets A,B C let 
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xeA,yeB Qi^^y) ■ -^^^ conductance of Ai is the quantity 

$ = min (7.5) 

sen 7r(5^) 

7r{S')<l/2 

where 7r(S') = J2xes ^(^)' ^'^^ 5 = i7 \ S*. 

For an unweighted simple graph G = (V, E) with tt, = m = 



Q(«,j) = 7r(2)P[«,j] = 

Thus 

QiS,S) 



^f{^,J)^E 



E{S : ^)/2m 



rf(^)/2m 

where E{S : S*) denotes the number of edges with one end in S and the other in 
S and d{S) = X]je5'^(0- 

Hence _ 

$ = $(G)= min (7.6) 

where tt{S) = X]je5^(0 = d{S)/2m is the probability of a random walk on G 
being in 5* when it is in the stationary distribution. 



To glean some intuition behind equation (7.6), observe that in the stationary dis- 
tribution, each of the 2m total orientations of the edges has the same probability 
of being transitioned. The quantity E{S : S) counts the number of orientations 
out of S, and d{S) counts the total number of orientations that start in a vertex 
of S. Thus E{S : S)/d{S) gives the probability, in the stationary distribution, 
of moving out of 5 at a given step given that the walk was in S. Or said in 
another way, when in the stationary distribution, it is the frequency of moving 
out of S, divided by the frequency of being in S. Intuitively, therefore, it would 
appear that a higher conductance might imply more rapid mixing of a random 
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walk. Indeed, this is the case, as can be seen from this important and useful 
theorem, which was independently proved by [SU] and 



Theorem 53 ([50j). Let A2 be the second largest eigenvalue of a reversible, 
aperiodic transition matrix P . Then 

$2 

— < 1 - A2 < 2$ (7.7) 



To be able to use Theorem [53] on a graph G, it needs to be non-bipartite so that 
it is aperiodic (see section 3.3, Lemma |8]). Furthermore, to use it in conjunction 
with inequality (7.4), we need to make sure that A* = A2. Both of these problems 
can be solved if we make the random walk lazy. This means replacing the 
transition matrix P with L = |P + |l, where I is the identity matrix. This 
introduces a loop probability of 1/2. It means that asymptotically, the cover 
time becomes precisely twice as large. Introducing this loop probability makes 
the graph non-bipartite, but moreover, making it have probability (at least) 1/2 
ensures that all eigenvalues are non-negative. This is easy to see: If x is an 
eigenvector of P with eigenvalue A, then 



and A + 1 > since A > —1 by the Perron- Frobenius theorem (see ( 7.2[ ) 
Furthermore, since x was an arbitrary eigenvector of P, all the eigenvectors 
of P are eigenvectors of L. This implies P and L have the same eigenvectors 
vi, V2, . . . , Vn, and the eigenvector Vi with eigenvalue Aj under P has eigenvalue 
^ (Aj + 1) under L. 

Thus, for the lazy walk L, A2 > |A„| and so A* = A2. Thus, using inequality 
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(7.7) in conjunction with inequahty (7.4), we have, for any i,j 



Our reason for expressing a probabihty distance bound in terms of conductance 
is that it is sometimes far easier to bound conductance than the eigenvalues of 
a graph. This is the case with the graph space we analyse in this chapter. In 



section 7.3.2 , we shall prove that, the conductance of a random graph with given 
degree sequence is at least 1/100, whp. This will imply that a random walk on 
such a graph is rapidly mixing whp, and this is crucial to the proof of Theorem 



49 The meaning of "rapidly mixing" in this context shall be made precise in 



due course. 



7.3 Random Graphs of a Given Degree Sequence: 
Structural Aspects 

7.3.1 The Configuration Model 

The configuration model is a random process that has proved useful for studying 
graphs with prescribed degree sequences. We assume the necessary condition 
that the sum of the degrees is even, XlILi ~ ^"^^ with each vertex i & V, 
we associate di half-edges, or stubs, which we consider distinguishable. Starting 
with an arbitrary stub from amongst the 2m, we choose another stub in the 
graph uar and pair the two. We repeat this process, taking an arbitrary stub 
and pairing it with another chosen uar from the remaining 2m — 3. We continue 
this way until all stubs have been paired. Since there are an even number of 
stubs in total, the process must terminate successfully. The set of pairings that 
results is called a configuration. A configuration C maps to a graph G{C) on 
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the same vertex set and with each stub pairing considered to constitute an edge 
in G{C). Note that there will be multiple configurations mapping to the same 
graph (i.e., the mapping from configurations to graphs is many-to-one). 

In the above process since every other stub was picked at random with no regard 
for which vertex it was associated with, we may have loops and/or multiple edges 
in the resulting graph. It is quite clear that because the stubs are distinguish- 
able, each possible configuration has the same probability, with the number of 
configurations being 

(2m - 1)!! = (2m - l)(2m - 3)...(1) - 



2'^m! 



(If we think of stubs as vertices themselves, we may consider this process as 
picking a perfect matching on the stubs uniformly at random from all possible 
matchings). For a degree sequence d, we shall write CM(d) for the configuration 
space for d. That is, CM(d) is the set all possible configurations on the 
set of vertices V with degree sequence d, and a uniform distribution on them. 
Thus, each configuration C e CM(d) is picked with probability 

The number of configurations mapping to a particular graph with the degree 
sequence in question is not uniform in general, that is, for graphs G, the sets 
{C e CM(d) : G{C) = G} and {C e CM(d) : G{C) = H} may be of different 
cardinality. However, each simple graph with the prescribed degree sequence d 
corresponds to Il'^^i{di)\ configurations. Thus, conditioning on the outcome of 
the process producing a simple graph, we have a uniform distribution. 

The point of the configuration model is that it tends to be much easier to analyse 
and prove statements with than a direct analysis of simple random graphs of a 
given degree sequence. However this is of little use if statements proved in the 
paradigm of the configuration model cannot be carried over to statements about 
the graph space of interest. We use the following principle. 
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Proposition 6. Suppose that for a configuration space CM(d) on n vertices 
there is a function 

f{n) < Pr(C picked uar from CM(d) is simple) 

and suppose that a statement V{d) proved in the configuration model (and thus 
parameterised on d) is false with some probability at most g{n). If g{n) = 
o{f{n)), then V{d) holds whp when we condition on the configuration C drawn 
from CM(d) being simple (and thus mapping to a simple graph). 

Each simple graph G is mapped to by n"^^((ij)! configurations, but the actual 
probability of a simple graph - that is, the total probability of the subspace of 
configurations which map to simple graphs, is difficult to determine precisely. 
Estimates are given, e.g. in [BU], but these require restrictions on the degree 



sequence. Further details will be given in section 7.4 



It should be noted that [32] gives a similar result, demonstrating that there exists 
a constant that is a lower bound for the conductance. The analysis in turn relies 
on results from 



7.3.2 Conductance: A Constant Lower Bound 

Let d = ((ii, (^2, dn) be a sequence of natural numbers and let G = (V, E) be 
a graph of n vertices chosen uar from the family Q (d) of all simple graphs with 
degree sequence d, i.e., such that di denotes the degree of vertex i. 

We make the following assumptions about the degree sequence: 

(i) X]r=i '^i ~ where m is a natural number. 

(ii) The minimum degree 5 > 3. 

(iii) The average degree 9 = 2m /n < n^, where 0<C<l/3isa constant. 



103 



7.3 Random Graphs of a Given Degree Sequence: Structural Aspects 



We work in the configuration model and make the following further assumption: 

(iv) d is further restricted in such a way that statements which fail with proba- 
bility at most n'^^^^^ in the configuration model hold whp when the model 
is conditioned on mapping to a simple graph (in other words, we can apply 
Proposition [6]) . 



As per the statement of Theorem 49 , in this chapter the cover time is analysed 



for nice sequences. As will be seen in section [7^ a nice sequence d has the prop- 



erty that Pr(C G CM(d) is simple) > e~°^^"^"'\ implying that if a statement is 
demonstrated to fail in the configuration model with probability at most n~^^^\ 
then by Proposition [6| it holds whp when we condition for simple graphs. 



The statements in the proof of Theorem 54 do indeed fail with probability at 
most n~^^^\ and so for the purposes of Theorem 49, the proof of Theorem 54 
is valid. Nevertheless, some of the conditions of nice sequences are specified for 
the sake of the analysis of cover time rather than conductance, and so we leave 



the details to section 7.4 and present a proof for Theorem 54 that depends on 



the more general assumptions (i)-(iv). 

Theorem 54. Subject to assumptions (i)-(iv), for a graph G G ^(d), > 
1/100 whp. 

Before we proceed with the proof, we note an immediate corollary: 



Corollary 55. G G ^(d) is connected whp. 

Proof For a set S <^V let d{S) = J2v£S '^('^) ^ configuration G, let 

Ec{Vi : V2) denote the number of edges with one end in Vi and the other in V2 
in G. Let £c{S) = Ec{S : S)/d{S). Let 7r(^) = d{S)/2m. 

We work in the configuration model. Our general approach is to show that when 
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a configuration C G CM(d) is picked, the set 

= {SCV: 7r{S) < 1/2, £ciS) < 1/100} 

is empty with probabihty at least 1 — n~^^^\ Having done this, the theorem 
follows by condition (iv). 

Throughout we make use of the following result of Stirling (see, e.g., [W]): 

n\ = V2^ (^^y e^" (7.9) 

where 

^ < A„ < (7.10) 



12n + 1 12n 



(7.10) implies 1 < e'^" < 1.1 and that e^" = 1 + 0(l/n). For notational conve- 



nience, we shall omit the correcting factor. 



A useful application of (7.9) is for fractions of the form {2k)\/k\ whence we get 



Let X = |#(C)| when C E CM(d) is picked. Let /3 = 99/100, e = 1 - (3, 

my- 



Ti2K) = IP and 



where [/3(i(S')'|* is the smallest even integer greater than or equal to Pd{S). For 
notational convenience, we omit the ceiling and * symbols and note that doing so 
can only incur (small) constant correcting factors that will not affect the results. 

'H(S') is an upper bound on the probability that a particular set of vertices S 
will have SciS) < 0.01 when C is picked. 
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By linearity of expectation 



(7.12) 



scv, 

7r{S)<l/2 



Letting d = d{S), the RHS of (7.11) can be expanded thus 



' d \ T{Pd)T{2m - (3d) 
Jd) J^(2m) 



id)\ 



{I3d)\ {2m-(3d)\ ml 



{/3d)\{d - (3d)\ {l3d/2)\ (m - /3d/2)\ (2m)! 
d\ (2m - I3d) 



{ed)\{l3d/2)\ {m-(3d/2)\ (2m) 



(7.13) 



By Stirling: 



and 



d\ 



^J2^d^e 



{edWdl2)\ 



v/2^(e9)^^e-^^y^(/39/2)/39/2e-/39/2 
1 1 



(2m -/39)! 
(m-/3(9/2)! 
m! 



^ |^4m - 2/3(9 



m-/39/2 



(2m)! a/2 V4m 



(7.14) 



(7.15) 
(7.16) 



We substitute (7.14), (7.15) (7.16) and into (7.13); observe there are seven fac- 



torial terms and we absorb the constant factor corrections from Stirling's ap- 
proximation and the dropping of the ceiling and * symbols into a constant K^. 
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So 



n{S) < K 



-Pd/2 



gd 



Am - 2/38 



K 1 

7^ (e9)^9(/39/2)/59/2 {Am - 2/38^9/2 

1 9 



m-l3a/2 



\Am/ 



K 
K 



d 



1 



{edy{^d/2y/^ {Am-2f3dy/ 

^ '2m- pd 



Am - 2(3d 
Am 

2m - f3d 
2m 



d 



€'13^/^ \2m - (3d 



P/2 



2m 



where ii' is a constant. 



(7.17) 



Call a set S small if d{S) < {9ny^^, otherwise call it large. We handle the cases 
of small and large sets separately, and let the random variables Y and Z count 
for them respectively, i.e., X = Y + Z. We show that E[Y] < n^^^^^ and E[Z] < 
n-^(i) and therefore, by Markov's inequality, Pr(X > 0) < E[F]+E[Z] < n"^(^). 



Small Sets 



We bound the part of the sum of (7.12) for small sets, and we partition it into 
sets of equal size (in the number of vertices). Since d{S) < {9ny^^, \S\ = o{n). 



E[r] < ^(^) 

d(S)<(en)V4 

where Si is the set of size i for which 'H(S') is greatest over all sets 5* over the 
range of the sum with IS"! = i. 

Let S' with |5"| = s and d' = d{S') be the set over the range of the sum for 



= E E «(s)<e(")«(s-) 

j=i 5cy, i=i ^ ' 

diS)<i9ny/'^, 
\S\=i 
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which (^)'H{Si) is greatest. Then 

o(n) 



Using (7.17), 



E[F] < o(^^)^ — 1 



d' 



e'Pf^/^ \2m- (3d' 



o{n) (— 1 



6^(5^1^ \2m/d' - (3 



(3/2 



13/2 



n d' 



-\ d' 



2m - I3d' 
2m 



(7.18) 



2m = On and d' < (OnY^^ so 2m/ d' > (On)^^^, hence, 



1 1 

< 



< 



1.1 



2M/d' -(3 - {Onfl^ - (3 - {dnf/^ 
for large enough n. Hence, 



1 



£^^/3/2 \2m/d' -[3 
Letting p = d' /s, 



13/2 



< 



1.1 



£2^//3/3(^n)3/4 



/3/2 



< 



1.3 



f3/2 



E[r] < o{n) 



. 3 ^-1 S 

en / 2 \ * 2 



s \9n 



< 



/ 2 \ 3/3p/8' 



< [en 



l/s+l-3,3p/8 



(7.19) 
(7.20) 



where (7.20) follows from (7.19) because 9 > 2. 



Now p > 3 so 1/s + 1 - 3;9p/8 < -0.01375 for s > 10. For sets of size s < 10, 
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we can do away with the o(n) term that muhiphes the sum and replace it by 
a constant. This means that the exponent of n is 1 — 3/3p/8 < —0.11. Thus, 
E[Y] < n-^(i). 



Large sets 

We now consider subsets 5" for which d{S) > {9nY^^. Let d{S) = p{S)cn = 
a{S)9n where p{S) = d{S)/\S\ and < a{S) < \. Let d = d{S), a = a{S), and 
note K/y/d < 1 for large enough n. Hence, by (7.17), 



nis) < 



d 



\2m-(3d 
1 ( aOn 



13/2 



2m - I3d 



2m 



-1 adn 



On - (3a9n\ ^ 



On 



1 



a 



P/2 



-1 aOn 



(a/3)"^(l-a/3)^-°^ 



2a 



(1 

= /(^)- 



(7.21) 



We split the proof of the large sets into two parts: Those sets for which a < 1/6 
and those for which 1/6 < a <l/2. 

a<l/6 

Let S'^ e Sc = {S C V : \S\ = cn} be such that f{S'J > f{S) for any S G Sc. 
For a constant < c < 1, define the random variable Zc = "^seSc (^^^^ 
indicator random variable l^(c){S) = 1 if and only if 5 G #(C)). 

Then 
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Applying Stirling's approximation to (^) we have 



n \ ni 



en) {cn)\{n — cn)\ 



V27rcn(cn)'="e-'="A/27r(l - c)n((l - c)n)(i-'^)'*e-(i-^)" 



K2 ( 1 



n 



where is some constant (which we shall assume absorbs the correcting factors 
e'^" in the Stirling approximation). 



Hence 



K2 f{al3)"(^{l-al3y-''(^^^ 



l-c 



Vc(l-c)n VV / c-(l-c) 

Consider the function 

g(x) = - x)^-^ , < a; < 1/2 

gi(0) = 1 and the function is monotonically decreasing with minimum g'(l/2) = 
1/2. 

Since 6 > 3, pen = aOn implies c < q;6'/3. Now a < 1/0 implies q;6'/3 < 1/2, 

therefore g{c) > g{a9/3). 

Hence 

E[Zc] < \ i y \ 



v/c(l-c)n ; (q;^/3)"^/3(1-q;^/3)i-^/3^ 



eRB\ae 
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Consider the function h{x, c) — (cx)^(l — cx) ^ where < c < 1 

ln(/i(a;, c)) = xhi{cx) + {1 — x) ln(l — cx) 

ln(h(x, c)) — X (- — ^ — J = at c = 1 
oc \c I — cx J 

^ln(Mx,c)) = -.^^ + ^A_2j<0 

Therefore h{x, c) < h{x, 1) = g{x). 

So h{a/3e/2,2/e) < g{a/3e/2) < g{ae/3). 
Hence 

Now 



and 



d 1 Al-a/3)^ \ 1 Al-a/3)^ \ / 

9^ I 1 - a/3 1^ (£^/3/5)" y j l-a/Sy (e^/?/^)" y \ (e^/3^)" 

Now 

(l-a/5)i 

— — — j7 — = 1 at a = 

and 

1^(1- a/3) = -/3 
aa 
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Consider 

and \vL{e^l3^f > -/3 when /3 = 0.99, hence ^^=^ < 1 for a > 0. Therefore, 

de \ (e^pP)"^ J 
and since 9 > 6 > 3, we have that 

Because aOn > (^n)^/^ for large sets, we have that 0.73" < 0.73^^")""^* and so 

E[ZJ< ^=£=0.73(5)'. 
a/c(1 - c)n 

1 < c < so c(l - c) > implying , ^ < = 1 + o(l). 

Therefore ^ 

E[Ze] < (l + o(l))ir20.73(^)'. 

Zc represents sets of size cn so multiplying the expression by n for an upper 
bound over all set sizes gives an upper bound of 0(^)0.73^9^) . Furthermore, 
by our assumptions 9 < n^, where 0<C<l/3isa constant, and so |j > n^' 

where C,' is a positive constant. This imphes O(n)0. < n ^ I . 

l/e <a < 1/2 

Let A = {5 C 1/ : 1/9 < a{S) = d{S)/9n < 1/2}. Let 5a e A be such that 
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/(^a) > f{S) for any S e A. Then 



E 



LSeA 



Using (7.21) with a = q:(5'a), 



JXSa 



Let 



A{a) 



(a/3)"^(l-a/3)^-"^\ ^ 



2a 



ln(A(a)) = (a/3) ln((a/3)) + (1 - a/3) ln(l - a/3) - 2a\n{e'l3^) 



d_ 
da 



ln(A(a)) = (3 ln(a/3) - /3 ln(l - a/3) - 2 ln(e"/3^) 



Setting ^ In (A (a)) = gives 



-2e 



a 



1 + £2^//3/32 



0.477 to 3 d.p. 



ln(A(a)) = /3 f- + 1 >0 



da"^ \a 1 — a/3 ^ 

therefore the stationary point is a minimum. Furthermore, A(l/2) < yl(l/3) so 
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A{a) < A{l/e). Hence, 



Let 



Then 



and 



ln(r(^)) = /31n/3 



HT{e)) = In ( ^—^ 1 < 0. 



Then since ^ > 3, we have that T{6) < T(3) and so 

'(/3/3)t(l -/3/3) 1(3-/3)' 



2"/(^a) < 



< 0.8^ 



This concludes the proof that E[Z] < n and Theorem 54 follows. □ 



7.4 Assumptions About Degree Sequence 

We are studying the cover time of a graph G picked uniformly at random uar 
from the set ^(d) of simple graphs with vertex set V = [n] and degree sequence 
d = (di, d2, . . . , dn). We make the following definitions: Let Vj = {i E V : di = 
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j} and let Uj = \Vj\. Let X^ILi ~ and let 9 = 2m/ n be the average degree. 

It seems reasonable to ask how the various entries in the degree sequence affect 
the cover time. In particular, how much does the cover time depend on the 
vertices of low degree, and how much on the average degree of the graph? In 



fact, as in [28], both parameters play a part, as is shown in Theorem 49 



Let < a < 1 and 0<fi;<l/llbe constants, and let dhe a. positive integer. 
Let 7 — 7- oo with n. We suppose the degree sequence d satisfies the following 
conditions: 



(i) Average degree 9 = o{^/logn). 

(ii) Minimum degree 6 > 3. 

(iii) For 5 < i < d, Ui = 0(n''*/°'). 

(iv) nd = an ± o{n). We call d the effective minimum degree. 

(v) Maximum degree A = 0{n'^^'^^^^^^). 

A 

(vi) Upper tail size ^ nj = 0(n''('^-i)/^). 

Immediately on fixing the degree sequence, d, some definitional problems arise, 
as e.g. there may be just a few low degree vertices spread over a wide range. 
To get round this, we define an absolute minimum degree 6, and an effective 
minimum degree d. The effective minimum degree is the first entry in the sorted 
degree sequence which occurs order n times. We fix the minimum degree at 3 
to ensure the graph is connected (whp). Between the minimum degree, and 
the effective minimum degree, we place an upper bound on the number nj of 
vertices of degree i. The bound we choose of rij = 0(n''*/°'), is not as arbitrary 
as it looks. Certainly when k = d/{d — 1), the effective minimum degree drops 
below d, so clearly there is some k, < d/{d — l) which is critical. Finally, we make 
some constraints on the average degree and upper tail of the degree sequence, 
to ensure simple graphs occur with high enough probability in the configuration 
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model. Thus we are left with the above list of conditions. 

We call a degree sequence d which satisfies conditions (i)-(vi) nice, and apply 
the same adjective to G G ^(d). 

Examples of nice sequences/graphs are: Any r- regular graph (where r is con- 
stant); The sequence which has at most n^/^° vertices with degree at most 
n^/^", and the rest are (constant) degree d. This will have average degree 
6 = {1 + o{l))d, but possibly high maximum degree; The sequence which has 
vertices with degree Q{n^^'^^), and the rest are equally divided between 



(constant) degree d and degree B ( -y/log n / log log n) . This will have a high aver- 
age degree Q{^/\ogn/ log log n) as well as a large number of high degree vertices. 



Recall from section 7.3.1 we discussed the set of all configurations with de- 
gree sequence d and with uniform probability, known as the configuration space 
CM(d). That section discussed the probability of a configuration C G CM(d) 
being simple. It was stated that each simple graph G with degree sequence d 
had Il'^^i{di)\ configurations mapping to it, thus, conditioning on simplicity, the 
distribution across simple graphs was uniform. However, the actual probability 
of simplicity was not elaborated on. A result of [60] gives asymptotic values of 
the probability of simplicity, but requires certain constraints on d. 

Let u = '^idi{di — l)/(2m). Assume that A = o{m}^^) (as it will be for nice 
sequences). The probability that C G CM(d) maps to a simple graph is given 
by 

Pr(C simple) ~ e"5-V. (7.22) 

Observe that our assumptions (i)-(vi) that d is nice imply that v = o{\^\ogn). 
Indeed if = y/log n/'-f^ where 7—7-00 then 

^ \i=3 1=76' / ^ 
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All the whp statements in the analysis below of the structure of random graphs 
fail with probability at most g{n) = n~^^^\ whereas Pr(C simple) in (7.22) is 
at least f{n) = e~°^^°^'^\ By Proposition [oj this justifies our use of CM(d) to 
make statements about ^(d). 



7.5 Estimating First Visit Probabilities 
7.5.1 Convergence of the Random Walk 

In this section G denotes a fixed connected graph with n vertices and m edges. 
A random walk VV„ is started from a vertex u. Let W„(t) be the vertex reached 
at step t and let Pi*V) = Pr(>V„(t) = v). 

We assume that the random walk Wu on G is ergodic i.e. G is not bipartite. 
Thus, the random walk Wu has the steady state distribution n, where vr^ = 
d{v)/{2m). Here d{v) is the degree of vertex v. 

Note Pi*V v) is the same as the quantity Pu)j defined in section I 



2.2 



The former 



notation emphasises the point that the starting vertex u becomes irrelevant after 
the mixing time, and Pu\v) becomes close to tTj,. 

7.5.2 Generating Function Formulation 

We use the approach of [20], [22] . 

Let D(t) = maxu^x&v \Pu\x) — vr^-l, and let T be such that, hi t >T 



max 

U,XS:V 



\P^\x)-n,\<n-^ (7.23) 
It follows from e.g. Aldous and Fill ^ that D{s + t) < 2D{s)D{t) and so for 
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k>l, 

max|Pf^)(x)-7r,| < (7.24) 



u,x&v ' " ' ' " ' n^'^ 
Fix two vertices u,v. Letting ht — Pu\v) for notational convenience, let 

oo 

H{z) = J2 htz^ (7-25) 

t=T 

generate ht ioi t >T. 

Next, considering the walk W^, starting at v, let rt = Pr(W^(i) = v) be the 
probability that this walk returns to v at step t — 0,1, .... Let 

oo 

t 



t=0 



generate r^. Our definition of return involves ro — 1. 

For t > T let ft = ft{u-^v) be the probability that the first visit of the walk 
to V in the period [T, T + 1, . . .] occurs at step t. Let 

oo 

t 



F{z) = Y,ftZ 



t=T 



generate ft. Then we have 



H{z) = F{z)R{z). (7.26) 



Finally, for R{z) let 



T-l 



Rt{z)^Y.'^,z^. (7.27) 
i=o 
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7.5.3 First Visit Time Lemma: Single Vertex 

Let 

A=^ (7.28) 

for some sufficiently large constant K. 

The following lemma should be viewed in the context that G is an n vertex 
graph which is part of a sequence of graphs with n growing to infinity. An al- 
most identical lemma was ffist proved in |20j . 

Lemma 56. Suppose that 

(a) For some constant > 0, we have 

min \R't(z)\ > ib. 

\z\<l+\ 

(b) T^TT^ = o(l) and Tn, = n{n'^). 
There exists 



Mu^v) = (1 + OiTn,)) J'r + 0(Tvr„e-^*/2). (7.30) 



where -Rt(I) is from (7.21), such that for all t >T, 

Pv 

[l+PvY^' 
We have the following corollary 

Corollary 57. For t > T let At{v) be the event that Wu does not visit v in 



steps T,T + 1, . . . ,t. Then, under the assumptions of Lemma 56 



1, J- ' Pv) 
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Proof We use Lemma 1561 and 

T>t 

and note that T^vr,, = o(l). □ 



Rather than reproduce the proof of Lemma 56, we instead present a simphfied 
version which conveys some of the core ideas behind it in a more straightforward 
form. This simphfied version gives a bound rather than a bound on Pr{At{v)) 



rather than the precise value given in Corollary 57 

NB We use may use Ry to denote -Rt(I), to emphasis the dependence of -Rr(l) 
upon V. 

7.5.4 First Visit Lemma: Simplification 

For the graph G = {V, E) let = |\^| and m = \E\. 

Visits to Single Vertices. 



Recall from section 2.2 the definition of H[p,a] for a distribution p over states 
of a Markov chain: 

H[p,a] = J^piH[i,a]. 

i 

The hitting time from the stationary distribution, H[7r, ly] = ^^^y 7r„H[t>, ly], 
can be expressed as H[7r,w] = where 



Z^,^ = J2iPi'\^)-^n.), (7.31) 



see e.g. 
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Let P denote the transition matrix for a random walk on G, and, for a walk Wv 
starting from v define 

T-l 

t=o 

Thus Rjj is the expected number of returns made by Wv to v during T steps, in 
the graph G. We note that Ry > 1, as Pv'^\v) = 1. 

In subsequent discussions, we will assume T is a mixing time satisfying 

l^r(^)-^.|<4, (7.33) 



for any vertices u and x in G. Let D{t) = max^^. \Pu\x) — 7r^|. As tt^, > 1/n 



for any vertex of a simple graph, (7.33) implies that D{T) < tTx for all x gV. 



Lemma 58. Let T be a mixing time of a random walk W„ on G satisfying 



(7.33). Let vertex v G V be such that T ■ vr^ = o(l), and vr^ < 1/2, then 



U[7r,v] = (1 + 0(1))^^^. (7.34) 

7r„ 



Proof 

Let D{t) = max^^a; \Pu\x) — 7r^|. It follows from e.g. [3] that D{s + t) < 
2D(s)D{t). Hence, since maxjj^^ \Pu^\x) — t[,j.\ < vr^,, then for each k > 1, 
max„,a; \p!i''^\x) - tTxI < (27r^,)^. Thus 



t=o 

< J](Pi*)(^)-vr.)+T5^(27r, 



t<T k>l 

Rv{T) - T -TTv + OiT -TTv) (7.35) 
i?,(T)(l + o(l)). (7.36) 
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The last inequality follows because Rv{T) > 1. 



□ 



Let At{v) denote the event that >V„ does not visit v in steps 0, ...,t. We next 
derive a crude upper bound for Pr(Aj(f)) in terms of H[7r,t>]. 



Lemma 59. LetT satisfy (7.33), then 



Pr{At{v)) < exp 



'l-o{l))[t/r.\ 



where = T + 2H[7r, v] . 

Proof Let p = p{G, T, u) be the distribution of Wu on G after T, then 



H[p,t;] = (l + o(l))H[7r,t;]. 



Indeed, 



= (l + o(l))^7r^H[' 



W, V 



[l + o{l))U.[n,v] 



(7.37) 



Let hp{v) be the time to hit v starting from p. Then E[/ip(f)] = H[p, t;] so by 



Markov's inequality and using (7.37) 



VT{hp{v) > 2U[n,v]) < 



By restarting the process at >V(0) = u, W(xu), >V(2xy), W([t/r^Jr„) we obtain 

, (1 + o(l))\ 
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□ 



Visits to Vertex Sets 

We extend the results presented above to a subset of vertices. Let 7^ S* C ^. 

From graph G we obtain a (multi)-graph F = by contracting S* to a single 
vertex 7. Note that we retain multiple edges and loops in F5, so that d{S) = ^(7) 
and that m = |-E'(G')| = |-E(F)|. Let vr be the stationary distribution of a random 
walk on F. If v ^ S then tt^, = and also that it s = TT.y. 

Any walk Wv from v ^ S to S in G with internal vertices not in S corresponds 
to an identical walk from f to 7 in Gs, and both walks have exactly the 
same probability. Thus we obtain 

H[7r,S] = H[7r,7] (7.38) 



Let P denote the transition matrix for a random walk on F5, let Tp denote the 
mixing time for F5, and define 

R, = (7-39) 

t=o 

Note that i?^ is the number of returns to 7 in the contracted graph F5, rather 
than a somewhat different notion of the number of returns to S in G. Note also 



that in the summation in (7.39) we use Tr, the mixing time in F, rather than 
the mixing time in G. 

For any vertices u and x in F, we will assume 

|Pf^)(x)-7r,| <n-2. (7.40) 
We will always choose mixing times T satisfying T > Tq, and T > Tr- Because 



123 



7.6 Required Graph Properties 



of this, the distributions pa = p{G,T,u) of Wu, and pr = p{T,T,u) of VVu, 
satisfy 

H[pG,5] = (l + o(l))H[pr,7] = (l + o(l))H[7r,S] = (1 + o(l))H[7r, 7]. 



We also note that provided TpTr^ = o(l) then Lemma 58 and Lemma 59 both 
apply to 7 in r. 



7.6 Required Graph Properties 



Let 

UJ = log log log n 
We use this value throughout our proof. 



(7.41) 



7.6.1 Mixing Time 



We shall define the mixing time to be the smallest T such that (7.23 ) is satisfied 



From inequality (7.8) we have 



|Pi"W-..|<(^) 



1/2 



2\ i 



(7.42) 



As explained in section 7.2.2, we require the walk to be lazy, meaning it only 



makes a move to a neighbour with probability 1/2 at any step. This halves the 



conductance but if ^{G) > 1/100 (which will be the case whp, by Theorem 54) 
we can still set 

r = u;^logn (7.43) 



to satisfy (7.23). The cover time is doubled. Asymptotically the values Ry are 



124 



7.6 Required Graph Properties 



doubled too. Otherwise, it has a neghgiblc effect on the analysis and we will 
ignore this for the rest of the chapter and continue as though there are no lazy 
steps. 

Note that A — 0{rf^ for some constant c < 1 implies T^7r„ = o(l) for any vertex 

V. 

7.6.2 Structural Properties 
Some definitions 

We show some properties held by almost all graphs in ^(d). First we define 
some terms: Let 

^ = Slogan (7.44) 

for some large constant B. 

(i) A vertex v is light if it has degree at most £, otherwise it is heavy. 

(ii) A cycle C or path P is small if it has at most 2u! + 1 vertices, otherwise it 

is large. 

(iii) A small path is light if all of it's vertices are light, otherwise it is heavy. 

(iv) A small cycle is light if it has at most one vertex that is heavy, otherwise 
it is heavy. 

For a vertex v, let G^, be the subgraph induced by the set of vertices within a 
distance cu of v. A vertex v is 

(v) locally tree-like if the subgraph induced by all vertices within distance cu of 

V is a tree, 

(vi) r-regular if it is locally tree-like and each vertex in the tree (possibly with 
the exception of v) has degree r. 
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(vii) r -tree- compliant or r-compliant if there exists a tree subgraph of G^, T„ 
rooted at v in which both of the following are true 

1. In T^,, vertex v has degree dy (that is, all edges incident on v in 
are part of T^). 

2. Every other vertex in Ty has degree r in Ty (where it may be the case 
that r = dy). 



The local graph Ty 

For a light vertex v, define the graph Ty as follows: If u G Gy is heavy, delete 
an edge {u, w) G Gy iff there is no path from w to f (inclusive of w) in Gy that 
is light. r„ is then taken to be the connected component of v in Gy after this 
process has completed. We denote by r° the union of the set of (heavy) vertices 
that have been pruned and the vertices distance u from the root v (when v is 



heavy, we do not need to consider this process, as shall be seen in section 7.9.1 ) 



Structural Properties 

Lemma 60. With high probability, 

(a) No pair of light cycles are connected by a light path. 

(b) No pair of light cycles intersect. 

Proof Before we prove the lemma, we show a useful inequality. For integer 
X > 0, let J-'{2x) = -^f, then 
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T{en-2x) {Qn-2x)\ (f)!2^ 



x-\ n \ /2a;-l \ ^ 



X -1 



% 



= - 2^ + 1 



< 



1=1 

1 



en-2x + l 



(a) The expected number of hght cycle-path-cycle (c-p-c) subgraphs // has the 
following upper bound: 



. ^ I'l' 1" (:) (:) (:)^^^'°«-'— 



(7.45) 



Explanation This is an upper bound on the number of configurations with a 
light c-p-c, divided by the total number of configurations. We choose a vertices 
for one cycle, b vertices for the other and c vertices for the path. We multiply 
by the number of ways of permuting vertices in the two cycles and the path. 
Every vertex but at most one in a cycle is light, hence has at most degree i. 
If there is a heavy vertex, it can have any degree up to A. Hence, each light 
vertex has up to i{i — 1) ways to connect to a neighbour. We therefore multiply 
£^ by itself {{a — 1) + {b — 1) + c) times for the number of ways to connect to 
a neighbour for the two cycles and the path. The remaining (unaccounted for), 
possibly heavy vertex in each cycle can connect in up to A(A — 1) ways to a 
neighbour in the cycle and A — 2 ways to a vertex on the path, hence A^ serves 
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as an upper bound. Since a c-p-c subgraph connects 2(a + b + c + 1) ports, the 
number of ways of connecting the remaining free ports in the configuration is 
counted by the function J^{9n — 2(a + fe + c+ l)). 



Thus, (7.45) is bounded by 



< 



< 



< 



< 



2ui+l 2a;+l 2w+l 

~£G 

a=3 6=3 c=l 
2lu+1 2uj+1 2uj+1 

~£6 ^ ^ 

a=3 b=3 c=l 



n\ I n \ I n 
a) \h) \c 

n\ fn\ fn 



a J \ b J \ c 



2u}+l 2w+l 2lj+1 



albld 



^)J^{9n-2{a + b + c+l)) 



9n - 12w - 7 



a+b+c+l 



en 



- 12a; - 7 ^ ^ ^ 



£4 - 12u; - 



a=3 6=3 
^ 2a;+l 2w+l 2u}+l 

.2u-7 5Z 5Z 5Z 



^ (n - a)! {n - 6)! (ra - c)! - 12a; - 7 



a+6+c 



a=3 6=3 c=l 

A6£6(2.+i)(2a; + l)3 



0n - 12a; - 7 



a+6+c 



£2(a+6+c) 



- 12a; - 7 

/ ^6^12^+2^3 



Thus Pr(yU > 0) = o(n '^), for some constant e > 0, since A = 0{n'^^'^ ^^/'^) 
where k, < 1/11. 

(b) The expected number of intersecting hght cycles has the following upper 
bound: 

E t (:) (:) it) (^..-«- A»^(*i^||^ (7.4^) 



Explanation We have a cycle of size a and another cycle which intersect with 
it. We treat the non-intersecting part of the second cycle as a chord of length b 
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on the a cycle. Since at least one vertex is shared between the cycles, we only 
have 2u as the upper limit of b in the inner sum. We need to pick one or two 
vertices of the a cycle to connect the chord to, thus the factor of (2) . The factor 
of 6! counts (an upper bound of) the number of permutations of the vertices on 
the chord and the powers of i and A play a similar role as per part (a) of the 
lemma. J^{9n — 2(a + 6 + 1)) counts the number of configurations of the rest of 
the ports because 2(a + 6 + 1) ports are occupied for a cycle and chord of size a 
and h respectively. 



Thus, (7.46) is bounded by 



< 



< 



en-8uj-3 ^ ^ (n-a)\(n- b)\ \en - Scu - 3 

a=3 b=l ^ ' ^ ' ^ 

„ 2u)+l 2u) ^ X a+b 

W I ^ ] £2(a+6) 



^ 9n-8uj-3 ^ ^ yen - Sco - 3 

a=3 b=l ^ 

Thus as in part (a), we have Pr(yU > 0) = o{n~'^), for some constant e > 0. □ 



Corollary 61. With high probability, is either a tree or contains a unique 
cycle. 

Proof Clearly, if v is locally tree-like, then r„ will be a tree. If Gy consists 
entirely of light vertices, then it can't have more than one cycle, since all cycles 
would be light and any two cycles would either intersect or be connected by a 
light path. By the lemma, whp no such pair of light cycles exist, and so Ty will 
contain a unique cycle. If Gy contains a heavy vertex h that has two light paths 
to the root v, then h is part of a light cycle. No other heavy vertex h' can have 
more than one light path to v as h' will also be part of a light cycle, and by the 
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lemma, whp no such pair of hght cycles exists. Therefore, any light cycle in 
will be unique (any other cycle will be heavy). Furthermore, there will be 
no more than two light paths from h because of h has at least three light paths 
to V, those three light paths would form an intersecting pair of cycles, which by 
the lemma, is forbidden whp. 

We see, therefore, that every heavy vertex in has one neighbour on a light 
path to f , with the exception of at most one heavy vertex, which can have two 
neighbours with such a property. Since the pruning process deletes edges to 
neighbours of a heavy vertex that do not have a light path to the root, a pruned 
(heavy) vertex will have at most degree 1 in F^,, with at most one vertex (/i), 
having degree 2 (and being part of a unique light cycle). 

□ 

Lemma 62. With high probability, 

(a) The number of vertices v & V that are not d-compliant is at most 7T,4«{<i-i)/ti 

(b) There is no small vertex v, 6 < d{y) < d which is not d-compliant. 

Proof (a) We lower bound the probabihty P that v is d-comphant by the 
success, in the configuration model, of the following random process. 
Process V: 

1. Each stub (half-edge) of v, is paired up with a stub of a vertex that has 
not been chosen thus far in the process. 

2. For 1 < i < CO — 1, and for each vertex w at level i, the first d—1 unpaired 
stubs of w pair with stubs of vertices u of degree d{u) > d that have not 
been chosen thus far in the process. 

The tree created by process V involves Ni-l^d^ E£o"^(^ " 1)' ^ ^(^ " 1)'^ 
pairings. Let a represent the sum of degrees of vertices of degree less than d. 
Thus 
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p > n 



9n — iA — a 



> 



9n-2i + l 

i=l 

On -2Ni + l 
On 

Let X count the number of vertices v that are not d-comphant. Using the 
inequahty 1 — (1 — x)^ < xy for real x, ?/, < x < 1, ?/ > 1, we have 

E[X]<n(l-P)=n 1- 1-^-— < ^ ^ 



On - 9 



Now 



i=0 



for real c ^ 1. Hence under the assumption rii = 0(n'^*/'^), we find that (7.47) 
gives a = 0{n^^'^~^^^'^). 

Given that A = 0{n'^^'^^^^^'^), we therefore have 

E[X] = 0(A=^ + Aa) = d{n'^^^'^'^^/'^) < ci{\og'' n)n=^"('^-^)/'^ 
for some Ci, C2 > 0. Then, 

Pr(X > n^'^('^-i)/'^) = d{n-^^'^-^^/'^). 



(b) In this case we have that the number of small vertices is 0(n''(°'~^^/'^) and so 
E[X] = 0{n'^'^^'^~~^^^'^ /n). Hence, by Markov's inequality, the proposition holds 
with probability at least 1 — n^^^\ □ 
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Lemma 63. With high probability, there are d-regular vertices v E V 

with d{v) = d. 

Proof We consider ci-regular vertices that have a root vertex v of degree d. 
Recall that = \Vd\ = an+o{n) for some constant a > 0. Let N2 = l+d{d—l)^. 
A (i-regular tree of depth u contains N2 vertices. We proceed in a similar manner 



to Lemma [62| and bound the probability P that a vertex v is ci-regular by 
bounding the probability of success of the construction of a d-regular tree in the 
configuration model. 



N2-1 



ID -D r ^ -J 1 ^ TT ^{71^ - i) f ,nd - N2 
F = Fr(a vertex v is d-regular) =11 — — r > I d- 



i=l 



en-2i + l 



en 



N2 



Let M count the number of d-regular vertices, then E[M] = /i = n^P, and 



jji = E[M] > n 



l-o(l) 



(7.48) 



To estimate Var[M], let ly be the indicator that vertex v is d-regular. We have 



(7.49) 



and 



E[J^J^] = Pr(t>,ti; are d-regular, G^iflG^ = 0)+Pr(t>,iu are d-regular, G^flG^ 7^ 0). 



Now 



2N2-2 



Pr{v, w are d-regular, fl = 0) = Y\_ 



i=l 



p^^l^ < P^. (7.50) 



For any vertex f , the number of vertices u such that G^ fl G^ 7^ is bounded 
from above by Ai'2 + dA^|. Using this and (7.50), we can bound (7.49) from above 
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hy IJ + + n{N2 + dNl). 

By the Chebychev Inequahty, for some constant < e < 1, 



/i 



1+2? 



1+2? 



1+2? 



for some constant e. 



The lemma now follows from (7.48). 



□ 



Definition 20. We call a nice graph G good if ^(G) > 1/100, and if the 



statements of lemmas 60 , 62 and 63 hold for G. 



The following is an immediate consequence of Definition 20 and the previously 
proved theorems and lemmas: 



Proposition 7. A graph G E G{d) is good whp. 



As we shall see, G being good allows us to use Lemma 56, and, in particular 



Corollary 57 That is, when G is good, the conditions of the lemma hold (to 



be demonstrated in section 7.8), and we have the tools to calculate the relevant 



quantities, in particular, (a bound on) the mixing time T (see section 7.6.1), and 



Ry, the expected number of returns in the mixing time to a vertex v. The latter 
quantity is calculated in the next section. 



7.7 Expected Number of Returns in the Mixing 
Time 



In this section we calculate bounds for the expected number of returns to a 
vertex v within the mixing time T. First we require the following lemma. 
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Lemma 64. Assume G is good. Let W* denote the walk on Ty starting at v 
with r° made into absorbing states. Let Rl = Yl'^o inhere is the probability 
that W* is at vertex v at time t. There exists a constant ( G (0, 1) such that 

r, = r:+o{c). 

Proof We bound — by using 

Rv-K= (En - r*] + ( E - - E <• (7-51) 

\t=0 / \t=uj+l / t=T+l 

Case t < u. When a particle starting from v lands at some vertex u G r°, this 
is either at at distance u, or m is a heavy vertex at distance less than u from v. 
In the case of u being heavy, by the light cycle condition, there are at most two 
light paths back to v from u of length at most u. All other paths of length at 
most oj go via other heavy vertices. Hence if a particle is at u, with probability 
at most 2/£ it will enter a light path to v. Thus the probability of reaching v in 
time u after having landed on a heavy vertex of r° is at most 0{uj/£) = o(C'^). 

In the alternative case that u is at distance u from v, then for t < u, = rt. 
Thus we can write 

o{C). (7.52) 




Case uj + l<t<T. Using (7.42) with x = u = v and ( = {1 - $72) < 1, 
we have for t > u, that rt = tTv + 0{(^). Since A = 0(n°),a < 1, we have 
TvTt, = oiC^) and so 



T T 



E - <\ = E ^ E + = OiO- (7.53) 



t=UJ + l t=U] + l t=Ul+l 



Case t > T + 1. It remains to estimate Yl't^r+i ^t- Let at be the probability that 
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the walk W* on Ty has not been absorbed by step t. Then < at, and so 

oo oo 

E < E 

t=T+l t=T+l 

Assume first that Ty is a tree. We estimate an upper bound for at as follows: 
Consider an unbiased random walk Xq^\x[''\... starting at \b\ < a < u on 
the finite line (— a, — a + 1, 0, 1, a), with absorbing states —a, a. Xm^ is 
the sum of m independent ±1 random variables. So the central limit theorem 
implies that there exists a constant c > such that 

Pr(x(°l > a or X^^'l < -a) > 1 - e'^^. 

Now for any t and b with \b\ < a, we have 

Pr(|xf)| <a) <Pr(|xf)| < a) 

which is justified with the following game: We have two walks, A and B coupled 
to each other, with A starting at position and B at position b, which, w.l.o.g, 
we shall assume is positive. The walk is a simple random walk which comes to 
a halt when either of the walks hits an absorbing state (that being, —a or a). 
Since they are coupled, B will win iff they drift {a — b) to the right from and 
A will win iff they drift —a to the left from 0. Therefore, given the symmetry of 
the walk, B has a higher chance of winning. 
Thus 

Pv{\xS^\ <a)< Pr(|x(°)| < a)^ 

since after t steps, the worst case position for the walk to be at is the origin, 0. 
Consequently, for any b with |6| < a, 

Pr{\X^%\ > a) > 1 - e-\ (7.54) 
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Hence, for t > 0, 

at = Pr(|X(°)| < a, r = 0, 1, . . . ,t) < e-L*/(2-')J. (7.55) 

Thus 



°° -r/(3cu;2) 



'£at<Y: < rf^TITIW) = 0{uVn^'-^) = OiC) (7.56) 

t=T+l t=T+l 



We now turn to the case where r^, contains a unique cycle C. The place where 



we have used the fact that is a tree is in (7.55) which relies on (7.54). Let x 
be the furthest vertex of C from f in F^,. This is the only possible place where 
the random walk is more likely to get closer to v at the next step. We can see 
this by considering the breadth first construction of r^,. Thus we can compare 
our walk with random walk on [—a, a] where there is a unique value x < a such 
that only at ±x is the walk more likely to move towards the origin and even 
then this probability is at most 2/3. 

Since we have 

Pr(3r < ca' : = x) > 1 - e-^^^ 

We now consider the probability the particle walks from e.g. x to a without 
returning to the cycle: There is 1/3 chance of moving away from the origin (to- 
wards a). Now from standard results, an unbiased random walk in one dimension 
between x and a starting at position x + 1 would have probability l/(a — x) of 
hitting a before hitting x. Since the actual walk will be biased in favour of 
movement toward a, a probability of l/3(a — x) serves as a lower bound on the 
probability of going from x to a without returning to x. 
Thus 

Pr(3r < ca^ : |xf^^„_^| = a) > (1 - e"^/2)/3a. 
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and 

at = Pr(|XW| < a, r = 0, 1, . . . ,t) < (1 - (1 - e-i/2)/3a)L*/(2-^)J < e-*/(20-)^ 

(7.57) 

Thus 

°o °o „-T/(20ca;)3 

E £ E ^ 1 _ = o(.V"«'") = 0(C) 

t=T+l t=T+l 

□ 

Lemma 65. Assume G is good. For a vertex v eV , 

(a) if V is d-regular, then Ry = ^ + O^C"^), 

(b) if V is d-tree- compliant then Ry < + 0(C'^), 

(c) ifdy <i,Ry< = 7^ < 8/3 
for some constant ( G (0, 1). 
Proof We use Lemma [64l 

(a) We calculate i?* for a walk W* on an r- regular tree F^, with r° made into 
absorbing states. For a biased random walk on (0, 1, k), starting at vertex 1, 
with absorbing states 0, k, and with transition probabilities at vertices (1, . . . , k — 
1) of g = Pr(move left), p = Pr(move right); then 

Pr(absorption at k) = — ^. (7.58) 

{q/pf-l 

This is the escape probability p - the probability that after the particle moves 
from V to an adjacent vertex, it reaches an absorbing state without having visited 
V again, i?* = 1 + 1/p - 1 = 1/ p. 
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We project W* onto (0, 1, . . . , with p = ^—^ and q = ^ giving 
and part (a) of the lemma follows. 



(b) If V is (i-tree-compliant, we can prune Gy removing edges from each vertex 
(other than v) until v is (i-regular. Treating the edges as having unit resistance, 
this pruning process cannot decrease the effective resistance between v and a 
hypothetical vertex ( that is connected by a zero-resistance edge to each of the 
vertices in r° (and no others). Then by part (a) and Rayleigh's monotonicity 
law part (b) of the lemma follows. (Here we are using the the fact that the 
probability of reaching ( before returning to v is equal to where R is the 
effective resistance between v and (. Rayleigh's Law states that deleting edges 
increases R). 

(c) Using (7.42) and the fact that Tvr^ = o(l), we have J2t=o''"t — X]t=o(^f + 
0(C*)) = 0(1), quickly demonstrating that R^ is bounded from above by a con- 
stant. To get the tighter bound, we observe that for any locally tree-like vertex 
V, Rv < jz^ + 0{C,^) by part (a). If there is a cycle (which will be unique) r^, will 
be tree-like up to some distance L — 1 < u and non-tree-like at level (distance) 
L, then there will be one vertex Vc on the cycle at level L if the cycle is an even 
size, and two vertices Vc, Wc if it is an odd size. The subtree up to level L — 1 will 
be 5-tree-compliant, and so if the particle does reach Uc € {vcWc}, it will halt 
if Uc is an absorbing vertex and move one level down (i.e., further away from 
the root) with probability at least (5 — 2)/5 if Uc is not an absorbing state. If it 
is an even cycle, it will move up the tree (i.e., closer to the root) back to level 
L — 1 with probability at most 2/6, and if it is an odd cycle, it will move up 
with probability at most 1/6 and stay at the same level (moving to the other 
vertex in {vc,Wc}) with probability at most 1/6. Levels below L (should they 
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exist) continue to be tree like and so maintain the probabihty distribution of 
going down with probabihty at least {S — 1)/S and going up with probability at 
most 1/(5. 

Given that there exists a unique cycle in Fy, Rl can be no larger than that for 
the case where Ty is such that all non-leaf vertices have degree 6 and the ab- 
sorbing states r° are all at distance u from the root. The only other variable is 
the location of the cycle. 

We shall resort again to electrical network theory to bound i?*. We assign a 
unit flow to with the root vertex v as the source and a hypothetical sink 
vertex that is connected by a zero-resistance edge to each of the vertices in r° 
(and no others). We treat each edge in as having unit resistance. When 
is a r-rcgular tree (apart from leaves), we split the flow coming into a vertex 
evenly across the r — 1 neighbours down the next level. Denote this energy by 
PT{r). Note Pt{t) > Pt{s) for r < s. To calculate bounds for the minimal 
energy when r^, contains a cycle, we assign the same flow as for a tree, except 
at the cycle. Consider flrst an even-sized cycle. There will be two sources of 
flow into Vc (deflned above), and by symmetry, these will be equal - call them 
/. If 6 = 3, these flows combine at Vc to become a 2/ flow going through one 
edge, then split off into two flows size / at the next vertex. The flow assign- 
ment at subsequent vertices is as per usual, with an even split at each vertex. 
If S > 3, then from Vc, we assign a flow of / to each of two edges into the next 
level down and then continue as normal. For 6 = 3 the energy Pr^, of the sys- 
tem is bound by Pt{S) + (2/)^ + 2p = Pt{S) + Qp and for 5 > 3 the energy is 
bounded by Pt{S) + 2P and since / < jjzfjs, we have a bound of Pt{S) + (^zfpp- 

For the case where the cycle is odd size, we assign the flows on the same basis as 
a tree (as in the case of the even cycle) except that the flow / which comes into 
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each of vertices {vc, Wc} will be directed from each vertex through a single edge 
going to the next level down. Thus for all S > 3, Pr„ < Pt{S) + 2/^, and since 
/ < 1/5 we have Pr„ < Pt{S)+2/6'^. We bound Pt{S) by calculating the effective 
resistance of an infinite 5-regular tree: PriS) < Xli^o s(s-iY ~ if^- Thus 
Pr^ < ~^ p"- This is an upper bound on the effective resistance between the 
root vertex and a hypothetical vertex connected to each vertex in r°. The escape 
probability p of a walk starting at the root is related as p = Res(f)/Pr„ where 
Res(f ) is the parallel resistance of the edges incident with v (or alternatively, 
the reciprocal of the sum of conductances of the edges incident with v). This is 
1/5 when dy = 5. Thus i?* < ^^(^t~2) which is monotonically decreasing and for 
5 = 3 is 8/3. 

□ 



7.8 Lemma Conditions 



We address the conditions of Lemma 56 For condition (b), we note that n < 



m < implies the condition Tvr^ = holds since T is larger than constant 

and T'^TTy = o(l) holds since = (w^logn)^ = o{n'^) for any positive constant 
c, and 7r„ = 0{n^'^) for some positive constant c. It remains to show condition 
(a) holds: 

Lemma 66. Assume G is good. For < 1 + A, there exists a constant > 
such that \Rt{z)\ > ip. 

Proof As before, let r° be the set of absorbing states of F^. We consider a 
walk in F^, that starts at v, and each time it leaves v it is terminated if it doesn't 
return to it within time T. It is also terminated if it reaches an absorbing state 
(states in F°) at any time. Let (3{z) = XlLi A-^* where /3j is the probability 
of a first return to v at time t < T. Let a{z) = 1/(1 — P{z)), and write 
<^{z) = ^i^o*^*-^*' thsX at is the probability that the walk is at v at time t. 



Note that at is not quite the same as in Lemma 65 , which would be generated 
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as 1/(1 — Ylt^i ^t)' where 6* represents the probabihty of a first return of W* at 
time t. Observe that at < < r^. 

We shall prove below that the radius of convergence of a{z) is at least l+i7(l/u;^). 



We can write 



Rt{z) = a{z) + Q{z 
1 



Qiz), (7.60) 



where Q{z) = Qi{z) + Q2iz), and 

T 

Qi{z) = ^{rt-at)z^ 



t=o 

oo 



Q2{Z) = Yl 



atz\ 



t=T+l 

We note that Q(0) = 0, a(0) = 1 and /3(0) = 0. 



We claim that the expression (7.60) is well defined for < 1 + A. We will show 
below that 

mz)\=o{l) (7.61) 

for |z| < 1 + 2A and thus the radius of convergence of Q2{.z) (and hence a{z)) is 
greater than 1 + A. This will imply that \P{z)\ < 1 for < 1 + A. For suppose 
there exists zq such that |/3(2;o)| > 1. Then /3(|-2o|) > |/3(-2o)| > 1 and we can 
assume (by scaling) that /3(|-Zo|) = 1- We have /3(0) < 1 and so we can assume 



that /3(|-2|) < 1 for < < l^ol- But as p approaches 1 from below, (7.60) is 
valid for z = p\zq\ and then |i?r(p|2;o|)| — ?■ oo, contradiction. 

Recall that A = 1/KT. Clearly /3(1) < 1 and so for l^l < 1 + A 
(3{\z\) < + A) < /3(1)(1 + A)^ < e^/^. 
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Using |1/(1 - I3{z))\ > 1/(1 + I3{\z\)) we obtain 

1 1 



\Qi^)\ > 



(7.62) 



l + P{\z\) - l + eVi^ 

We now prove that 1(5(^)1 = o(l) for |z| < 1 + A and the lemma will follow. 

Turning our attention first to Qi{z), the following results hold both for a tree, 
and containing a cycle. 



|Qi(^)| < (1 + Af |gi(l)| < e^/^ 5^ In - a,| 

t=o 



(7.63) 



Now using part of the proof of lemma 65, we see that X]t=o I'"* ~ ~ 
hence |<5i(;z)| = o(l). 



We now turn our attention to Q2{z). As in lemma 65, let at be the probability 



that a walk on r^, (with r° absorbing states) starting at v has not been absorbed 
at step t. Then cr^ > aj, so 



\Q2{z)\ < at\z\\ 



t=T+l 



If Gy is a tree we can use (7.55) to see that the radius of convergence of Q2{z) 
is at least e^/^^^^^l As a < e^/^^ca^) ^ i + 2A and for < 1 + 2A, 



\Q2iz)\ < J2 e2"*-L*/(2-^)J = 0(1). 



t=T+l 



This lower bounds the radius of convergence of a{z), proves (7.61) and then 



(7.63), (7.61) and (7.62) complete the proof of the case where r„ is a tree. 
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In the case that contains a unique cycle, we can use (7.57) to see that the 



radius of convergence of Q2{.z) is at least 1 + > 1 + 2 A, assuming that K 



(defined in (7.28)) is sufficiently large. Finally, if ^ G Ca then 



mz)\ < J2 e 



(A-l/(20ca;3))t 



< 



t=T+l 



-T/(25cw3) 
p-l/(25ctj3) 



□ 



7.9 Cover Times 



We now prove the main theorem of the chapter. Theorem 49, which we restate 
here, 



Theorem 49 (restated) Let G he chosen uar from G{d), where d is nice. Then 
whp, 

COV[G]~^^nlogn, (7.64) 
where d is the effective minimum degree and 9 is the average degree. 
In the following, we assume G is good. 

7.9.1 Upper Bound on Cover Time 



As per the definition in section |2.3[ c„ is the time taken by the random walk W^j 
starting from vertex u to visit every vertex of a connected graph G. Let Ut be 
the number of vertices of G which have not been visited by W„ by step t. We 
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note the following: 



COV^iG] = E[cJ = 5^Pr(c„>t), 



(7.65) 



i>0 



Pr(c„ >t)= Pr{cu >t-l) = Pr{Ut-i > 0) < min{l, E[Ut-i]}. (7.66) 



Recall from (7.30) that As{v) is the event that vertex v has not been visited by 



time s. It follows from (7.65), (7.66) that 



COy^[G]<t + l + J2E[U,]=t + l + J2J2Pr{As{v)). 



(7.67) 



s>t 



V s>t 



Let to = (^rii) ^logn and ti = (1 + e) to; were e = o(l) is sufficiently large that 
all inequalities claimed below hold. We assume that the high probability claims 



of Sections 7.4, 7.7 hold. 



Recall from (7.29) that pv = (1 + 0{Tn^,))dv/ {OnRy) and that Corollary 57 



gives 



the probability that >V„ has not visited v during [T, t] as 



Pv{At{v)) 



:i + 0(T7r,)) , ,-A^/2^ 



+ 0(T'7r,e- 



(l + o(l))(l+p. 



Thus 



Y.{i+o{i)){i + 



Pv 



(l + o(l))(l+p,)-*^5^(l+p,) 



-(t-ii) 



t>tl 



t>ti 



We consider the following subsets of V: 
(ii) Vb = Ud<i{'^ G : f is rf-compliant}. 
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[iii)Vc' = Ud<j<^{^ E Vi : V is not (i-compliant}. 



Case (i): 6 < < d. 

For these vertices, is (i-compliant by Lemma 62 Consider vertices in Vi, 
i < d. By Lemma 65 (b), -Rt, < (1 + o(l))^ so for v E Vi (7.68) is bounded by 



Recall that \Vi\ = 0{n'''/'^) where k < 1. Thus 



Case (ii): d < dy, V is d- compliant. 



For V E Vb (7.68) is bounded by 0{6)n Therefore 



v&Vb t>ti 



vGVb 



Case (Hi): d < d^ < i, V is not d- compliant. 



For vertices v E Vc (7.68) is bounded by 0{6n)n 



T^<i-2 where TZ < 8/3. 



By Lemma 62, IV^I < n^'^'^'^ ^^Z'', where < k < 1/11 hence 



■tPv 



inm 



v&Vc 



smce 



(d— 1) , ^ , 1 d — 1 , , x/'4k , ^ , 3 \ d — 1 

4«L_i_(i + ew)-— < (d-i) (- - (1 + ew)^j^l < 
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Case (iv): I < dy 

Clearly, Rv < T = w^\ogn thus, for vertices f G Vd (7.68) is bounded by 



o{ti 



v£Vn t>ti 



vGVd 



In each of the cases above, the term J2v J2s>t^^i-^s{v)) = o(ti) and thus, from 
(7.67), COV„[G] < (1 + o(l))ti as required. This completes the proof of the 
upper bound on cover time of G. □ 



7.9.2 Lower Bound on Cover Time 

Let ^2 = (1 ~ e)^0) were e = o(l) is sufficiently large that all inequalities claimed 
below hold. For vertex u of degree d, we exhibit a set of vertices S such that at 
time t2 the probability the set S is covered by the walk Wu tends to zero. Hence 
Cu > t2, whp which implies that COV[G] >to — o(to)- 

We construct S as follows. Let Sd be the set of (i-regular vertices of degree d. 
Lemma 63 tells us that l^f^l = n^'^^^K Let 5* be a maximal subset of Sd such that 
the distance between any two elements of S is least co. Thus IS"! = ^l(n^~°^^y d^). 

Let S{t) denote the subset of 5* which has not been visited by W„ after step t. 
Let V E S, then 

PriAt.iv)) = (l + o(l))(l+p,)-*2+o(n-2) 

= (1 + o(l))e-*2P"(i-o(P")) + o(n-2). 

(The above is justified because (1 + x)^ = e^^^^sCi+^O = (.y{x-4+f--) = (.y^{i-o{x)) 
for real x,y with |x| < 1). 
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Hence, 



E{\S{t,)\) > (l + o(l))|5|e- 



n 



e/2-o(l) 



'{l-e)toPv 



— 7- OO. 



(7.69) 



Let Y^^t be the indicator for the event At{v). Let Z = {v, w} C S. We will show 
(below) that that for v,w & S 



t2 



(7.70) 



where pz ^ Pv + Pw Thus 



E(n,t,>^.,,tJ = (l + o(n-2))E(i;,ijE(r^,iJ 



which implies 



Ei\Sit,mSit,)\ - 1)) ~ E(|S(t2)|)(E(|5(t2)|) - 1). 



(7.71) 



It follows from (7.69) and (7.71), that 



Pr{S{t2) 7^ 0) > 



n\s{h)\y 
E(|5(t2)p; 



E(\S{t2)\(\S(t2)hl)) 
E(|5(t2)|)2 



+ Ei\Sih)\)-^ 



1-0(1). 



Proof of (7.70). Let G be obtained from G by merging t;, w into a single node 



Z. This node has degree 2d. 



We apply Lemma 56 to G. That tt^ = |^ is clear. Furthermore, the vertex Z 



65 



a IS 



is tree-like up to distance u in G. The derivation of -R^ as in Lemma 
valid. The fact that the root vertex of the corresponding infinite tree has degree 
2d does not affect the calculation of R*^. 
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Thus 



1 + 0(T7r,) 1 + 0(T7rJ , , 

= -\- 0\TI 

= Pr(Ai,(t;))Pr(Ai,H) + o(n-2) 



Now 

P^(Ai,(Z)) = 5^Pf)(x)P^(>V.(t-T)^Z,r<t<t2) 

= 5^(^ + 0(n-3)W(>v,(t_T)^Z,T<t<t2 



^ (Pf ) + 0(n-=^)) Pr(>V.(t - T) ^ t;, < t < ^?)72) 
Pr(At,(t;) A At,H) + 0(n-3) 



Equation (7.72) follows because there is a natural measure-preserving mapping 
from the set of walks in G which start at u and do not visit f or w, to the 
corresponding set of walks in G which do not visit Z. 

We have shown that 

Pr{At,iv) A At,{w)) = (1 + oin-^))Pr{At,{v))Pr{At,{w)) + 0(71-^) 

□ 
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Chapter 8 

Weighted Random Walks 



Chapters [6] and [7] studied random walks on different graph structures, but in 
both cases the wallcs were simple, meaning that each step an edge is chosen 
uar from the edges incident on the current vertex, and the walk transitions 
that edge. However, we may also define random walks on graphs G = {V,E,c) 
with differently-weighted edges, where an edge e = {u, v) has some non-negative 
weight, or conductance, c(e), and the probability of transitioning e from u is 
given by c{e)/c{u) where c{u) = J2e={uv)eE^i^)- Simple random walks are a 
special case where c is a constant function. Chapters [2} [3] and |4] discuss the 
details, and we refer the reader there. We shall call random walks on weighted 
graphs weighted random walks. 

In this chapter, to avoid confusion between conductance c(e) of an edge, and 
conductance $(A^) of a Markov chain Ai as defined in Definition 19, we instead 
refer to edge conductance as weight and use w rather than c. We will still use 
resistance r(e) and effective resistance R{u,v) as there is no ambiguity. 

As with chapters [6] and [7], we deal only with simple, connected, undirected 
graphs. 
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8.1 Weighted Random Walks: Hitting Time and 
Cover Time 

For simple random walks, asymptotically tight bounds for cover time were given 
by [36] and [37]: 

(1 + o(l))nlogn < COV[G] < (1 + oil))^n\ 
and these lower and upper bounds are met by the complete graph and the lollipop 



graph respectively. See section 5.3.1 



However, the hitting and cover times for weighted random walks have not been 
studied in as much depth, and it is not difficult to formulate many open questions 
on their behaviour. In particular, what bounds exist for hitting times and cover 
time? This was addressed in part by [17] and [H]. The investigation is framed 
as follows. For a graph G let ViG) denote the set of all transition probability 
matrices for G, that is, stochastic matrices (i.e., non-negative entries with each 
row summing to 1), that respect the graph structure. The latter means that 
Pu,v = if (M,f) ^ E{G). For P e V{G) let Hg{P) denote the maximum 
hitting time in G with transition matrix P, and Cg{P) similarly for cover time. 
Let 

Hg= inf Hg{P) and Gg = inf Gg{P). 

PeV{G) PeV(G) 

Note that if for a graph G one knows a spanning tree Tg, a transition matrix P 
can be constructed that is a simple random walk on T^, and ignores all other 



edges of G. As per section 5.2, this implies a O(n^) upper bound on Hq and 
Cg. 

It is asked 
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(1) What are general upper bounds on 



H(n) = max Hq and C(n) = max Cq- 

G:V{G)=n G:V{G)=n 



(2) What is the minimum local topological information on G that is always 
sufficient to construct a transition matrix that achieves each of H{n) and 
C(n)? 



In [IS], it is shown that for a path graph P„, any transition matrix will have 
f2(n^) maximum hitting time (and therefore, cover time). This, in conjunction 
with the spanning tree argument, implies Q{n'^) for both H{n) and C{n). 

The second question is addressed with a particular weighting scheme that gives 
O(n^) maximum hitting time for any graph. Note, Matthews' upper bound tech- 



nique (see section 5.2.3) applies to weighted walks as well, implying a 0{n logn) 
bound on cover time. The weighting scheme, which we call the Ikeda scheme is 
as follows: 



iivE N{u) 



P„_^ = <j E^eivMl/V'^H (8.1) 

otherwise 

where d{v) is the degree of v and N{v) is the neighbour set of v. 

A note on application In the algorithmic context of graph exploration, sim- 
ple random walks are generally considered to have the benefit of not requiring 
information beyond what is needed to choose the next edge uar. Generally, this 
implies that a token making the walk can be assumed to know the degree of 
the vertex it is currently on, but no more information about the structure of 
the graph. In the Ikeda scheme, information required in addition to the vertex 
degree, is the degrees of neighbouring vertices. Other local information - such 
as connectivity, is not required. 



151 



8.2 Minimum Degree Weighting Scheme 



8.2 Minimum Degree Weighting Scheme 

We present an alternative weighting to the Ikeda scheme: Simply, for G = {V, E), 
assign each edge {u,v) weight w{u,v) = 1 / mm{d{u) , d{v)} (equivalently, each 
edge is assigned resistance r{u,v) = m.m{d{u) , d{v)}) . This weighting scheme 
defines the transition matrix of a weighted random walk. We may, as a matter 
of convenience, say that w{u,v) = if {u,v) ^ E in calculations of transition 
probabilities. 

We call this scheme the minimum degree (or min-deg) scheme. It places no 
greater information requirement in applications than the Ikeda scheme; it also 
only uses limited, local knowledge. In fact, it is easily checked that equation 



(8.1) is equivalent to weighting the edges w{u,v) = 1/ ^d{u)d{v). 



We have the following (in)equalities. 



w[u,v) < —— + — - < 2w[u,v). 
d[u) d{v) 

w{u) = w{u, v). 

veN{u) 

veN{u) v£N{u) 

E rf^ + rf^ = E^(«)^rf(-i=- 



1.2) 
13) 

U) 
15) 

16) 



Equation (8.3) is directly from the definition in section 2.1.1, equation (2.4). 
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8.2 Minimum Degree Weighting Scheme 



From equation ( 2.5[ ) (and noting that we are using w to stand in for c) 



w{G) = ^w{u) = 2 ^ w{u,v), 



SO summing over all edges and using (8.2) 



w{G)=2 J2 y^M<2 J2 dk^dL)-^ ^ w{u,v) = 2w{G) 



17) 



Then using (8.6) in the middle part of (8.7), 



n < w{G) < 2n. (8.8) 
Thus, the min-deg weighting scheme has the following transition probabilities: 



l/min{d{u),d{v)} ^ ^ jy/ \ 

otherwise 



8.2.1 Hitting Time 

The proof of O(n^) hitting time in |1S] applies the following, generally useful 
lemma. 

Lemma 67. For any connected graph G and any pair of vertices u,v E V{G), 
let p = (xo, Xi, . . . , Xi) where Xq = u and xi = v be a shortest path between u and 
V. Then 



i=0 



where d{x) is the degree of vertex x and n = \ V{G)\. 
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Proof For any < z < z + 2 < j < £, it is the case that {xi, Xj) ^ E{G) and 
N{xi) n N{xj) = 0. If not, then there would be a shorter path. Therefore, each 
vertex in G can be connected to at most 3 vertices on p, and the lemma follows. 

□ 

We have the following 



Theorem 68. For a graph G under the min-deg weighting scheme, H[m, v] < Qn^ 
for any pair of vertices m, f G V{G). 

Proof Let p = {xq, Xi, . . . , xi) where Xq = u and X£ = v he a shortest path 
between u and v. 



H[u,v] < 



i=0 

i-i 



< 



J2COM[xi,x,+i] 



i=0 



w{G)'^R{xi,Xi+i) 



.10) 



(8.111 



(8.12) 



where R{x,y) is the effective resistance between vertices x and y and (8.12) 



follows (8.12) by Theorem 22 
Now 



R{x, y) < r(x, y) = mm{d{x), d{y)} 



and so 



i-l £-1 l-\ 

^i?(xj,Xi+i) < ^min{d(a;i),rf(xi+i)} < ^^(xi) < 3n, 



j=0 



i=0 



j=0 



where the last inequality follows by Lemma 67 



By (8.8) we have w{G) < 2n, and the theorem follows. 



□ 
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By Matthews' technique (Theorem 30), 



Corollary 69. COV[G] = 0{n^\ogn). 



does not use electrical network theory; the proof that the Ikeda weighting 
scheme results in an O(n^) upper bound on hitting time is different. Further- 
more, [IH] conjectures that this weighting scheme in fact gives an O(n^) upper 
bound on cover time. We do the same for our weighting scheme: 



Conjecture 2. The minimum degree weighting scheme has 0{n'^) cover time on 
all graphs G. 

To our knowledge, no weighting scheme has been shown to meet an 0{n^) on all 
graphs G (where, as stated in the introduction to the chapter, G is assumed to 
be simple, connected and undirected). 



8.3 Random Graphs of a Given Degree Sequence 

In chapter [7] we determined the following theorem: 



Theorem |49| (restated) Let G he chosen uar from G{d), where d is nice. Then 
whp, 

COV[G]~^^nlogn, (8.13) 

where d is the effective minimum degree and 9 is the average degree. 



Recall uj = log log log n as defined in equation (7.41). In this section we prove 
the following: 

Theorem 70. Let G be chosen uar from Q{d), where d is nice with the following 
extra restriction: A < . Weight the edges of G with the min-deg weighting 
scheme, that is, for an edge {u,v), assign it weight w{u,v) = 1/ min{d{u),d{v)}. 
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Denote the resulting graph Gw Then whp, 



COV[GJ^ < (1 + 0(1))^ Snlogn. (8.14) 

— I 



where 5 is the minimum degree. 



Note that the degree sequence assumptions of section 7.4 allow for the ratio 9/d 
to be unbounded. Hence, the ratio of the min-deg cover time to the simple cover 
time, that is, the speed up, can be unbounded. 

Our approach in this section is similar to that in the chapter [7| and we borrow 
heavily from there. We sometimes use d^ for d{v). 



8.3.1 Conductance 



We restate the definition of conductance from section 7.2.2: Q{u,v) = 'k{u)Pu^vi 

and for a set 5 C V, 7r{S) = X]«e5 ^(^)' ^' = ^\'^ ^^"^ Q(^^ = '^ues,ves' Qi"^^ 
The conductance ^{Gw) of G^ is defined as 

<I>(G^) = mm 



t(5)<1/2 Tt{S) 

We show 

Lemma 71. For G E ^(d) where d is nice, $(6*^) > 1/(100 A) whp. 
Proof 

Since w{u) = J2v&v (where w{u,v) = if {u,v) ^ E) and 7t(u) = J^^-*^ 

and Pu,v = we have Q{u,v) = ■it{u)Pu,v = ^^Iff^ and 



Q{S,S') = —^ w{u,v) 
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8.3 Random Graphs of a Given Degree Sequence 



Since 



we have 



TT 



wiu) 



mm 



7r(5)<l/2 E„65^H 



(8.15) 



In section |7.3.2| we showed that whp the conductance is bounded below by 
e = 1/100: For a graph G picked uar from ^(d), subject to d being nice,, whp 



(8.16) 



for any set S such that vr(S') < 1/2 (recall £'(5' : S") is the set of edges with one 
end in S and the other in S, and ^(5*) = X]i,es' '^('^))- This implied that for an 



unweighted (or uniformly weighted) graph, $(G) > e, equation (8.15) becomes 

\E{S:S)\ 



$(G) 



mm 



7r(5)<l/2 d{S) 



If A is the maximum degree in d, then w{e) > 1/A for any edge e. Therefore, 
$(^0,) > <I>(G)/A > 1/(100A). □ 



We set 



Then, 



T = u logn. 



(8.17) 



Corollary 72. Given the value of T from (8.17) and the restriction on A, by 



section 7.6.1, equation (7.42), t>T satisfies (7.23) 
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Proof Section 7.6.1, equation (7.42) restated: 

1/2 



\P^\^)-^.\ < (- 



1 



2\ * 



< A^/^l^l- 

< w^/^ exp 



ij^ log n 



where is a constant. 



□ 



8.3.2 The Stationary Distribution 



Lemma 73. For a vertex u, 



1 , , diu) 
2n- ^ ^ - n 



Proof -^-^ - ^ 
Corollary 74. TTi{u) = o(l). 



n(u] = Now use ( |8.4[ ) and ( |8.5[ ) with ( |8.8[ ) 



□ 



8.3.3 The Number of Returns in the Mixing Time 

Recall the definitions in section 17.6.21 and the results in section 17.71 We first 



require a version of Lemma 64 for weighted F^. We only require the case that it 
has no cycles. 

Lemma 75. Assume is good. Let W* denote the walk on (weighted) 
starting at v with r° made into absorbing states. Assume further that there are 
no cycles in r°. Let = Xlt^o^t where rj" is the probability that W* is at vertex 
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V at time t. 



'^(v^)^ 



Proof We use the proof of Lemma 64 to prove equation (7.51), reproduced 
here 



T 



ID p* 



t=u)+l / t=T+l 



.i=0 



Firstly, observe that since r° has no cycles and no heavy vertices (by the restric- 
tion on A), it is a tree with leaves at distance u. 

Case t < u. For t < u, = rt. Thus we can write 



E 

,i=0 



n - r. 



0. 



119) 



Case u + 1 < t < T . We use (7.42) with x = u = v and observe z 



'I - 



^{GwY /2) < (1 - l/K^) for some constant K. Observe Ttt^ = 0{llrf) for 
some constant c > 0, so T-k^ = o(y/coe^^). Hence, 



T T T 

\rt - r*\ 

t=U} + l t=UJ + l t=Ul + l 



(8.20) 



Case t >T + 1. The proof of Lemma 64 for this case up to (7.56) applies, with 



the final equality in (7.56) replaced with 



□ 



Lemma 76. Subject to the extra restrictions on d, for a min-deg weighted good 
G G ^(d) and a vertex v G V{G), 



(a) ifu is locally tree-like then = + 0{y^e'^^'^'^) 
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(b) Ru < shl^ + 0(v^e-^(^)) < 6 + 0(v^e-^(^)). 

Proof (a) Suppose u is locally tree-like, but not necessarily regular. Let 
G{u) be the locally (i.e., out to distance u = log log log n) induced subgraph of 
Gw and let v G G{v). Let p be the probability of movement away from u and q 
the probability toward it. Observe the weighting scheme gives 

q ~ 1/5 ~ 5 
This ratio is the same as an unweighted walk on a 5-regular vertex u. Therefore, 



using Lemma 75 and by the same principles as Lemma 65 



— 2 



(b) Since any cycle in r° is unique, at most two of the edges, (m, fi), {u,V2) out 
of u lead to vertices on a cycle. Let bad be the event event of moving from u to 
some V G {vi,V2} Then 

2 / S 2 / S 

^"■^^""^^ - 2/5 + K-2)(l/rf„) = 2/5+1- 2/du - 



5-2 
5 ' 



The probability of moving from u to some v G N{u)\ {vi, f 2} is then at least 
implying returns to u in expectation for every transition from u to N{u) \ 
{vi,V2}- Assuming that a move from u to Vi or V2 always results in an immediate 
return, we can bound < g^ijE^ + 0{^/ue-^^^^)) < 6 + 0{^/ue-^^^^). □ 



8.3.4 The Number of Vertices not Locally Tree-like 

We wish to bound the number of vertices v that are not locally tree-like, i.e.. 



for which has a cycle. We use the following from the proof of Lemma [60] of 
section 



7.6.2 



J'{2x) = then 
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J={dn -2x) ^ f 1 



T{9n) - \9n-2x^ 

Lemma 77. With probability at least 1 — n^^^^\ the number of vertices not 
locally tree-like is at most n^^^'^ . 

Proof The expected number of small cycles has upper bound 



^ \9n-2kJ 

I — q V / 



k=3 
2UJ+1 

< 



k=3 
2u}+l 

fc=3 
^ ^4(a;+l) 



n 



dn-Auj-2 



Therefore, the expected number of vertices within distance w of a cycle is at most 
^i{u>+i)^u, = /S.^^+^, Since A < w^/^, the lemma follows by Markov's inequality. 

□ 



8.3.5 The Probability a Vertex is Unvisited 



We use the same principles as in section 7.9.1, Restating (7.67) 



COV^iG] <t + l + 5^5^Pr(A,(i;)). 



(8.21) 



V s>t 



We use Lemmas 58 and 59 , which hold for weighted random walks (see chapter 



2, General Markov Chains, in |1] for justification of (7.31) and the inequality 
D{s + t) < 2D{s)D(t). All other expressions in the proofs hold for weighted 
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random walks). Thus, 



Pr{At{v)) < exp 



-(l-o(l))Lt/r,„J 



where = T + 2H[7r,t>] and H[7r,t>] = (1 + o{l))Rv/ny. 
Hence, for a given v, 

-(l-o(l))b/r.J 



s>t s>t ^ 



< E exp 

s>[t/r„J 



-(l-o(l))s 



< 3Ty exp 



3r^ exp 



-(l-o(l))Lt/r.J 



■(l-o(l)) 



tvr,, 



Ttt, + (1 + o(l))2i?. 



Since Ttt^ = o(l) and tt^ > l/2n from (8.18), we get 



EPr(A,(t;)) <3r,exp (- 

s>t ^ 



-(1-0(1)) 



{l + o{l))AnR, 



Let t = t* = (1 + e)8|3|nlogr;, where e — )■ sufficiently slowly. Then 



J2 Pr(A,(t;)) < 3r, exp f-{l + e(e)) 



6 — 1 log n 
(5-2 A, 



(8.22) 



We partition the double sum '^^^s>tPT^{As{v)) from (8.21) into 



5^5^Pr(A.(t;))+5^5^Pr(A.(t;)) 

v&Va s>t vGVb s>t 



where Va_ are locally tree-like and Vb are not. 
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If V is locally tree-like, then using Theorem 76 (a), the RHS of (8.22) is bounded 
by 

3r,n-(i+®(^)) = 3(T + 2(1 + o(l))i?,/7r,)n-(^+®(^)) 
< (l + o(l))12ni?,n"(^+®(^)) 
= 0(l)n-®(^) 



Thus, 



^Pr(A,(i;)) < 0(l)ni-®(^) = o{t). 



(8.23) 



For any v (i.e., including those not locally tree-like), (8.22) is bounded by 



3r„n-(^+®(^»^ < 0(l)n^-(^+®(^))5$^ 



(8.24) 



Using Lemma 77 to sum the bound (8.24) over all non locally tree-like vertices, 
we get 

XlPr(A,(t;)) < 0(l)n^+'"^'+®('))^ = 0{n^) = o{t). (8.25) 

vGVb s>t 



Hence, combining (8.21), (8.23) and (8.25) for t = t*, Theorem 70 follows. □ 



Compare this with (8.13), we see that the speed up, 

COV[G] 



s 



COY[G^ 



Therefore 5— j-ooasn— )-ooif6'— J-ooasn— )-oo. That is, we can have an 
unbounded speed up. 

We conjecture that the following tighter bound holds: 
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Conjecture 3. Equation (8.14) can be replaced by 



COViGJ < (1 + 0(1))^ nlogn. 
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Chapter 9 



Conclusion 



9.1 Main Results 

In this thesis, we have addressed the following question, first posed in the Intro- 
duction: 

For a random walk Wu on a simple, connected, undirected graph G — {V^E), 
what is the expected number of steps required to visit all the vertices in G, max- 
imised over starting vertices u ? 

We have given answers to this question for three cases: 

1. Simple random walks on graphs G of the form G — GinG2 where □ is the 
Cartesian product operator, 

2. Simple random walks on graphs G picked uar from the space of graphs 
Q (d) where d is a given degree sequence satisfying various constraints, 

3. Weighted random walks for general graphs, and graphs G picked Ucir from 
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In particular, we have proved the following theorems 
For the first case, we showed in chapter [6] 



Theorem [39] Lei F = {Vp, Ep) = GUH where G = {Vg, Eg) and H = {Vh, Eh) 
are simple, connected, unweighted, undirected graphs. We have 



COV[F] > max | (^1 + COV[H], (^1 + GOV 



[G] 



Suppose further that uh > -Dg + 1; then 



COV[F] <K[[l + ^\ BCOV[i/] + ^^omnnne 



6h J COV[H]Dg 

where M = \Ep\ = riGmH + nnfnG, i = log{DG + 1) log(nG-DG) and K is some 
universal constant. 

(We remind the reader that in the above, for a graph F, np = |y(F)|, mr = 
|-E'(F)|, Dy is the diameter of F, and 5y and are the minimum and maximum 
degrees respectively). 

For the second case, we showed in chapter [7j 



Theorem 49 Let G he chosen uar from Q{d), where d is nice. Then whp, 

COV[G]~^^%logn, 
a — 2a 

where d is the effective minimum degree and 9 is the average degree. 



Theorem [39] answers the question within bounds, whilst Theorem [49] gives a pre- 
cise asymptotic value. We reiterate some definitions: a statement V{n) parame- 
terised on an integer n holds with high probability (whp) if Pr{V{n) is true) — )■ 1 
as n — )■ oo] the notation f{n) ~ g{n) means f{n)/g{n) — )■ 1 as — > oo. 



^Theorems, lemmas and conjectures presented in this chapter are restatements of ones 
presented previously. 
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The third case was addressed in chapter [8j Each edge e is given a non-negative 
weight w{e), and the probabihty of transitioning e from vertex u is w{e)/w{u) 
where w{u) = J2e={uv)eE'^(^)- presented a simple weighting scheme, the 
minimum degree (min-deg) weighting scheme, where each edge e = {u, v) has 
w{e) = 1/ mm{d{u) , d{v)} . That is, the transition probability matrix was 



p 



otherwise 



We proved the following theorem for general graphs {G is assumed to be simple, 
connected and undirected): 



Theorem 68 For a graph G under the min-deg weighting scheme, H[m, v] < 6n 



for any pair of vertices m, f G V{G). 



Consequently, by Matthews' technique (Theorem 30 ) 
Corollary |69|COV[G'] = 0(n2 log n). 



We then studied this weighted walk on the specific class of graphs that is the 



subject of Theorem 49, applying the min-deg weighting scheme to G G ^(d). 



We presented the following theorem: 



Theorem 70 Let G be chosen uar from Q{d), where d is nice with the following 
extra restriction: A < a;3 . Weight the edges of G with the min-deg weighting 
scheme, that is, for an edge {u,v), assign it weight w{u,v) = 1/ min{d{u),d{v)}. 
Denote the resulting graph G^ Then whp, 

COV[Gj < (1 + 0(1))^ Snlogn. (9.1) 

0—2 

where 6 is the minimum degree. 

We have, by the above, extended the body of knowledge on random walks on 
graphs, in particular, the cover time of random walks on graphs. 
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9.2 Secondary Results 

Additionally, in the course of proving the above theorems, we have presented 
a number of results which may be of independent interest. Specifically, we 
have given results related to effective resistance of the Cartesian product of 
two graphs, a frame work for analysing walks on a graph by analysing "local 
observations" of the walk, and structural results for G G ^(d). 

The following lemmas were proved in chapter [6] 



Lemma 47 For a graph G and tree T, R^axiG^T) < AR^axiGOP^) where 



\V(T)\ < r < 2\V{T)\ and Pr is the path on r vertices. 



Lemma 48 For graphs G, H suppose Dq + 1 < uh < ct^Dc + 1), for some a. 



Then Rmax{GOH) < (^alog{DG + 1), where C is some universal constant. 

(We remind the reader that RmaxiG) is the maximum effective resistance between 
a pair of vertices in a graph G). 



In chapter [6| to prove the Theorem 39 we presented a framework to bound the 
cover time of a random walk W(G') on a graph G by dividing the graph up into 
regions - sets of vertices S C V{G) - and analysing the behaviour of the walk 
whilst on those regions. It does so by relating >V(G') whilst walking on S" to a 
random walk 'W{H) on where H is weighted graph derived from (G, S). The 
analysis of the VV(//) walks can then be composed over the whole graph. Thus 
the analysis of the whole graph is reduced to the analysis of outcomes on local 
regions and subsequent compositions of those outcomes. This framework can be 
applied more generally than Cartesian products. The details are presented in 
section I 



In chapter [7| we presented the following result on the conductance of a graph 
G e g{d): 



Theorem 54 Subject to assumptions (i)-(iv) listed in section 1.3.2, for a graph 
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G e g{d), <^{G) > 1/100 whp. 

9.3 Future Work 

We discuss some possible avenues for further study. 

9.3.1 Cover Time of other Random Graph Models 

Similar techniques to those in chapter [7] have been used to study the cover times 
of other models of random graphs, for example, the preferential attachment 
graph [22], the random geometric graph [27] and the giant component of the 



Erdos-Renyi random graph [23] (see section 7. 1 for a discussion of these models) . 
We envisage that the techniques can also be extended to other models of random 
graphs, for example, the random intersection graph [TU], [SS], the BoUobas- 
Chung (B-C) model [H], and the Watts-Strogatz (W-S) model [700 

We quote pUJ for the definition of the (uniform) random intersection graphj^ 

The uniform random intersection graph G{n, m, k) is a random graph 
defined as follows. Let be a set of n nodes, and let M be a set 
of m colours. To each node v & V we assign a subset C M of 
k distinct colours, chosen uniformly and independently at random 
from the fc-subsets of M. We join distinct nodes u,v & V hj a.n edge 
if and only if F„ n ^ 0. 



^The Watts-Strogatz model is motivated by the desire to find graph models whose properties 
more accurately reflect "real-world" networks such as social networks. The W-S model has 
some of these desired properties, such as small distance between nodes (hence the term small 
world model). See |7D]. The BoUobas-Chung model was motivated by the desire to create 
bounded-degree graphs with small diameter. |14j demonstrates approximately logj n diameter 
for the B-C model. 

•^ [10] motivates random intersection graphs by wireless networks, where a colour is a unique 
cryptographic key, and a pair of sensors can communicated if and only if they have a key in 
common. 
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The B-C model is a n-cycle with a random matching. In the W-S model, the 
graph begins as an n cycle with each vertex v connected to k others; the k/2 near- 
est neigbours on each side. Labelling the vertices round the cycle (0, 1, . . . , n— 1), 
for each vertex i in turn, "rewire" each edge (i, j), i < j to {i, k) where k is chosen 
uar from amongst the vertices that would avoid loops or parallel edges. 

As we envisage it, the crux of a proof, as in chapter [7], would be the application 



of Lemma |56] This would require verification that the conditions of the lemma 
hold, determining a suitable mixing time T, degree distribution of the vertices, 
and calculations for the expected number of returns to a vertex v within the 
mixing time. 

We envisage that the aforementioned models will be rapidly mixing (i.e., T = 
0(log"n) for some constant a), and that, for sufficiently sparse graphs, it will be 
possible to determine degree distributions and demonstrate tree-like local graph 
structures (which will facilitate the calculation of R^). Indeed, for the B-C 
model, each vertex has degree 3 by construction, and |32j shows T = 0(log?7,), 



thus satisfying condition (b) of Lemma 56 



9.3.2 Weighted Random Walks 

The study of weighted random walks, in particular, the cover time, is not nearly 
as developed as for the unweighted case. We reiterate the following conjecture, 
made in chapter |8| 

Conjecture g The minimum degree weighting scheme has 0{n^) cover time on 
all graphs G. 

We do not believe that "standard" techniques (for examples, those presented in 
chapter [sj and subsequent chapters) are sufficient to prove this conjecture. In 
fact, we believe that it may require new techniques or further development of 
the underlying theory of weighted random walks. 
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It may not be as demanding, however, to improve the bound of Theorem 70 
even prove the following conjecture, made in chapter [8j 

Conjecture^ Equation (8.14) can be replaced by 



or 



COV[G^] < (1 + 0(1))^ nlogn. 
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